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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



AAAACCCCAA 


AACCCCAAAA 


CCCCTTTTAG 


AGCCCTGCAG 


TTGGAAATAT 


AACCTCAGTA 


60 


TTAATAAGCT 


CAGATTTTAA 


ATATTAATTA 


CAAAACCTAA 


ATGGAGGTTG 


ATGTTGATAA 


120 


TCAAGCTGAT 


AATCATGGCA 


TTCACTCAGC 


TCTTAAGACT 


TGTGAAGAAA 


TTAAAGAAGC 


180 


TAAAACGTTG 


TACTCTTGGA 


TCCAGAAAGT 


TATTAGATGA 


AGAAATCAAT 


CTCAAAGTCA 


240 


TTATAAAGAT 


TTAGAAGATA 


TTAAAATATT 


TGCGCAGACA 


AATATTGTTG 


CTACTCCACG 


300 


AGACTATAAT 


GAAGAAGATT 


TTAAAGTTAT 


TGCAAGAAAA 


GAAGTATTTT 


CAACTGGACT 


360 


AATGATCGAA 


CTTATTGACA 


AATGCTTAGT 


TGAACTTCTT 


TCATCAAGCG 


ATGTTTCAGA 


420 


TAGACAAAAA 


CTT CAATGAT 


TTGGATTTCA 


ACTTAAGGGA 


AATCAATTAG 


CAAAGACCCA 


480 


TTTATTAACA 


GCTCTTTCAA 


CTCAAAAGCA 


GTATTTCTTT 


CAAGACGAAT 


GGAACCAAGT 


540 


TAGAGCAATG 


ATTGGAAATG 


AGCTCTTCCG 


ACATCTCTAC 


ACTAAATATT 


TAATATTCCA 


600 


GCGAACTTCT 


GAAGGAACTC 


TTGTTCAATT 


TTGCGGGAAT 


AACGTTTTTG 


ATCATTTGAA 


660 


AGTCAACGAT 


AAGTTTGACA 


AAAAGCAAAA 


AGGTGGAGCA 


GCAGACATGA 


ATGAACCTCG 


720 


ATGTTGATCA 


ACCTGCAAAT 


ACAATGTCAA 


GAATGAGAAA 


GATCACTTTC 


TCAACAACAT 


780 


CAACGTGCCG 


AATTGGAATA 


ATATGAAATC 


AAGAACCAGA 


ATATTTTATT 


GCACTCATTT 


840 


TAATAGAAAT 


AAC CAATTCT 


TCAAAAAGCA 


TGAGTTTGTG 


AGTAACAAAA 


ACAATATTTC 


900 


AGCGATGGAC 


AGAGCTCAGA 


CGATATTCAC 


GAATATATTC 


AGATTTAATA 


GAATTAGAAA 


960 


GAAGCTAAAA 


GATAAGGTTA 


TCGAAAAAAT 


TGCCTACATG 


CTTGAGAAAG 


TCAAAGATTT 


1020 


TAACTTCAAC 


TACTATTTAA 


CAAAATCTTG 


TCCTCTTCCA 


GAAAATTGGC 


GGGAACGGAA 


1080 


ACAAAAAATC 


GAAAACTTGA 


TAAATAAAAC 


TAGAGAAGAA 


AAGTCGAAGT 


ACTATGAAGA 


1140 


GCTGTTTAGC 


TACACAACTG 


ATAATAAATG 


CGTCACACAA 


TTTATTAATG 


AATTTTTCTA 


1200 


CAATATACTC 


CCCAAAGACT 


TTTTGACTGG 


AAGAAAC GGT 


AAGAATTTTC 


AAAAGAAAGT 


1260 


TAAGAAATAT 


GTGGAACTAA 


ACAAGCATGA 


ACTCATTCAC 


AAAAACTTAT 


TGCTTGAGAA 


1320 


GATCAATACA 


AGAGAAATAT 


CATGGATGCA 


GGTTGAGACC 


TCTGCAAAGC 


ATTTTTATTA 


1380 


TTTTGATCAC 


GAAAACATCT 


ACGTCTTATG 


GAAATTGCTC 


CGATGGATAT 


TCGAGGATCT 


1440 


CGTCGTCTCG 


CTGATTAGAT 


GATTTTTCTA 


TGTCAC CGAG 


CAACAGAAAA 


GTTACTCCAA 


1500 


AAC CTATTAC 


TACAGAAAGA 


ATATTTGGGA 


CGTCATTATG 


AAAATGTCAA 


TCGCAGACTT 


1560 


AAAGAAGGAA 


ACGCTTGCTG 


AGGTCCAAGA 


AAAAGAGGTT 


GAAGAATGGA 


AAAAGTCGCT 


1620 


TGGATTTGCA 


CCTGGAAAAC 


TCAGACTAAT 


ACCGAAGAAA 


ACTACTTTCC 


GTCCAATTAT 


1680 


GACTTTCAAT 


AAGAAGATTG 


TAAATTCAGA 


CCGGAAGACT 


ACAAAATTAA 


CTACAAATAC 


1740 
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/**tv "A /"imrn "A mm /"i 

GAAGTTATTG 


-n y, yirpyirrtyi7\ y»rp 

AACT C TC ACT 


TAATGCTTAA 


yi -n /-I T\ nirr>s~i a 7\ r* 

GAC AT TG AAG 


7A TV rri TV y^* TV 7V rp/-1 PTt 

AAT AGAA I Cr 1 


rprp7\ TV T\ C* 7\ r P/ r ^/ r ~ 1 


1 OAfl 

lt3 UO 


TTTTGGATTC 


GCTGTTTTTA 


71 nmii rpyi 71 rpyi TV, 

AC 1 A 1 G A I G A 


mnm-fi t\ rriO AAA 

1 (a 1 AA 1 G AAA 


AACr I A 1 bAub 


Ab XXXwXXXva 


1 q a a 


yA 7\ 7\ my* /** TV TV /*i 

CAAATGGAAG 


y* TV t\ yi rprriy^i yi 71 yt 

CAAG r X GGAC 


AAi^/^AAAA /""T 1 

AACCAAAAC X 


Ll 1 CI X XGCA 


"A /T"P TV rp/^l 7\ tp TV 

ACTAlvjtaAlA 


TTT* A A A A fTP 

X LbAfWib X 


1 q 0 n 


•j\ m 7V rpyi 7\ rp t\ ytrn 

ATATGATAGT 


GTAAACAGAG 


A A A A A /"^T' A T 1 /^ 1 

AAAAACXAXC 


AALAi X CC I A 


7\ tv TV 7,piTi7\ y^Ti 7\ 

AAAAC 1 AC 1 A 


A A T""P A 1 11 1 11 1 iri 

AAX X AL-X 1 1L 


1 QQA 

xyou 


TTCAGATTTC 


myt /*i *a mm 7\ rnyi tv 

TGGATTATGA 


yirTIyi yi 7\ ytTV A 7\ rri 

CTGCACAAAT 


rp/^irn 7V 7V A /"* A fi 7V 

T CTAAAGAG A 


TV TV fl 7\ 7A T^TV 7A 7A 

AAGAA I AAC A 


XAGX XAX CCjA 




TTCGAAAAAC 


mmm tv ^* tv. tv tv t\ 

TTTAGAAAGA 


yt "Ti yi TV TV "A m/"t 7\ TV 

AAGAAATGAA 


AGAT T ATTT T 


AG AC ACjAAA i 


X C C AGAAGA X 




TGCACTTGAA 


GGAGGACAAT 


■a m yi yi *a a ytyirrirTi 

ATCCAACCTT 


ATTCAGTGTT 


i^imm^ tv tv tv 7v T»^** 

CTTGAAAATG 


A A fi A A A A *T»/"t TV 

AACAAAATGA 




CTTAAATGCA 


IV TV y— * TV H TV TV TV m 

AAGAAAACAT 


TAATTGTTGA 


TV yt yi 7\ TV TV /"i y* TV TV 

AG C AAAG C AA 


TV /"* TV TV 7V Tlrn 7V fprrl 

AGAAA 1 TAT I 


n"i»A a r* a a a r* a 
X X AAbAAAbA 


0 0 0 r\ 


TAACTTACTT 


CAACCAGTCA 


TTAATATTTG 


ytynv "a m t\ rn A tv rn 

CCAATATAAT 


TACAT 1 AAC 1 


X X AA X GGG AA 




GTTTTATAAA 


CAAACAAAAG 


yt-A a mm.i/im/*) a 

GAATTCCTCA 


7* yiyi rnyrprpTiyl 7\ 

AGGTCTTTGA 


G TTT CAT C AA 


X X X XGX CAX C 


0 1 a rs 


ATTTTATTAT 


GCAACATTAG 


AGGAAAGCTC 


CTTAGGATTC 


CTTAGAG A X G 


AA X CAAX GAA 


0 /i n a 
2400 


CCCTGAAAAT 


C CAAATGTT A 


ATCTTCTAAT 


GAGACTTACA 


GATGACTATC 


TTTTGATTAC 


24o 0 


AACTCAAGAG 


AATAATGCAG 


TATTGTTTAT 


TGAGAAACTT 


*n m "TV TV TV /*T FT* TV TV 

ATAAACGTAA 


/trn^rirfin A TV A TV 

GTCGTGAAAA 


O CT O A 

2 52 0 


TGGATTTAAA 


TTCAATATGA 


AGAAACTACA 


GACTAGTTTT 


/™t/"» tv mm tv tv /rnn 

CC ATT AAG TC 


r*»A A Or* 1 A A A T'T' 

C AAG C AAA X X 


O C Q A 

2 bo 0 


TGCAAAATAC 


GGAATGGATA 


GTGTTGAGGA 


GCAAAATATT 


rrtm tv TV TV ■ rn 1 1 

GTTGAAGA TT 


ACXGCGAX XG 


2o4 0 


GATTGGCATC 


TCAATTGATA 


TGAAAACTCT 


TG CTT TAATG 


yi*— i7\ t\ tv r-p t\ rrtm t\ 

CCAAATA1 TA 


71 r*rn*vr t a r* A A 1* 
AL X X oAvjtAAX 


0 1 n n 


AGAAGGAATT 


CTGTGTACAC 


rnyiA Ti rnyirri A tv "a 

T C AAT CT AAA 


CATGCAAACA 


AAuAAAuLA X 


AA X X vavjv^ X 




CAAGAAGAAA 


CTAAAGTCGT 


rrirnrrirri tv "A m/^ni 7\ 

TTTTAATGAA 


fln tv tv TV rnfT 1 7\ f~*f~* 

TAAC ATT AC C 


y* TV mm tv rn rnrrtrp -n 

LAJ. 1 Al i 1 1A 


VjAAAVjiAU. VIA X 




TACAACCGAA 


GACTTTGCGA 


a m t\ -a 7v -a ytrri yi m 

ATAAAACT CT 


CAACAAG 1 I A 


rTirrtrpTv rp7\ pp/**" 7\ yt 

X X X AX AX UiU 






TV m a /™i "a rtiyi y* "A 7\ 

ATACATGCAA 


m^l TV yi yl yi * n T\ yi 

TG AG C C AAAG 


7\ 7\rn7\/*l7\ TV yiyi 7V 

AATACAAGGA 






X f% X v3ri\j \^r\\J 


2940 


TATGATCGAC 


TTAGAGGTAT 


CTAAAATTAT 


ATACTCTGTA 


AC C AGAGC AT 


TCTTTAAATA 


3000 


CCTTGTGTGC 


AATATTAAGG 


ATACAATTTT 


TGGAGAGGAG 


CATTATCCAG 


ACTTTTTCCT 


3060 


TAGCACACTG 


AAGCACTTTA 


TTGAAATATT 


CAGCACAAAA 


AAGTAGATTT 


TCAACAGAGT 


312 0 


TTGCATGATC 


CTCAAGGCAA 


AAGAAGCAAA 


GCTAAAAAGT 


GACCAATGTC 


AATCTCTAAT 


3180 


TCAATATGAT 


GCATAGTCGA 


CTATTCTAAC 


TTATTTTGGA 


AAGTTAATTT 


TCAATTTTTG 


3240 


TCTTATATAC 


TGGGGTTTTG 


GGGTTTTGGG 


GTTTTGGGG 






3279 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Glu Val Asp Val Asp Asn Gin Ala Asp Asn His Gly lie His Ser 
1 5 10 15 

Ala Leu Lys Thr Cys Glu Glu lie Lys Glu Ala Lys Thr Leu Tyr Ser 
20 25 30 

Trp lie Gin Lys Val He Arg Cys Arg Asn Gin Ser Gin Ser His Tyr 
35 40 45 

Lys Asp Leu Glu Asp He Lys He Phe Ala Gin Thr Asn He Val Ala 
50 55 60 

Thr Pro Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val He Ala Arg Lys 
65 70 75 80 

Glu Val Phe Ser Thr Gly Leu Met He Glu Leu He Asp Lys Cys Leu 
85 90 95 

Val Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin 
100 105 110 

Cys Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu 
115 120 125 

Leu Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp 
130 135 140 

Asn Gin Val Arg Ala Met He Gly Asn Glu Leu Phe Arg His Leu Tyr 
145 150 155 160 

Thr Lys Tyr Leu He Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin 
165 170 175 

Phe Cys Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe 
180 185 190 

Asp Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys 
195 200 205 

Cys Ser Thr Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu 
210 215 220 

Asn Asn He Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg 
225 230 235 240 

He Phe Tyr Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys 
245 250 255 

His Glu Phe Val Ser Asn Lys Asn Asn He Ser Ala Met Asp Arg Ala 
260 265 270 

Gin Thr He Phe Thr Asn He Phe Arg Phe Asn Arg He Arg Lys Lys 
275 280 285 

Leu Lys Asp Lys Val lie Glu Lys He Ala Tyr Met Leu Glu Lys Val 
290 295 300 

Lys Asp Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro 
305 310 315 320 
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Glu Asn Trp Arg Glu Arg Lys Gin Lys lie Glu Asn Leu lie Asn Lys 
325 330 335 

Thr Arg Glu Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr 
340 345 350 

Thr Asp Asn Lys Cys Val Thr Gin Phe lie Asn Glu Phe Phe Tyr Asn 
355 360 365 

lie Leu Pro Lys Asp Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin 
370 375 380 

Lys Lys Val Lys Lys Tyr Val Glu Leu Asn Lys His Glu Leu lie His 
385 390 395 400 

Lys Asn Leu Leu Leu Glu Lys He Asn Thr Arg Glu He Ser Trp Met 
405 410 415 

Gin Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn 
420 425 430 

He Tyr Val Leu Trp Lys Leu Leu Arg Trp He Phe Glu Asp Leu Val 
435 440 445 

Val Ser Leu He Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser 
450 455 460 

Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys Asn He Trp Asp Val He Met 
465 470 475 480 

Lys Met Ser lie Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin 
485 490 495 

Glu Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly 
500 505 510 

Lys Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met Thr 
515 520 525 

Phe Asn Lys Lys He Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr 
530 . 535 540 

Thr Asn Thr Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys 
545 550 555 560 

Asn Arg Met Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp 
565 570 575 

Asp Val Met Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val 
580 585 590 

Gly Gin Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr 
595 600 605 

Asp Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
.610 615 620 

Leu Leu Ser Ser Asp Phe Trp He Met Thr Ala Gin He Leu Lys Arg 
625 630 635 640 

Lys Asn Asn He Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu Met 
645 650 655 
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Lys Asp Tyr Phe Arg Gin Lys Phe Gin Lys lie Ala Leu Glu Gly Gly 
660 665 670 

Gin Tyr Pro Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu 
675 680 685 

Asn Ala Lys Lys Thr Leu lie Val Glu Ala Lys Gin Arg Asn Tyr Phe 
690 695 700 

Lys Lys Asp Asn Leu Leu Gin Pro Val lie Asn lie Cys Gin Tyr Asn 
705 710 715 720 

Tyr lie Asn Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro 
725 730 735 

Gin Gly Leu Cys Val Ser Ser lie Leu Ser Ser Phe Tyr Tyr Ala Thr 
740 745 750 

Leu Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro 
755 760 765 

Glu Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu 
770 775 780 

Leu lie Thr Thr Gin Glu Asn Asn Ala Val Leu Phe lie Glu Lys Leu 
785 790 795 800 

lie Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu 
805 810 815 

Gin Thr Ser Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met 
820 825 830 

Asp Ser Val Glu Glu Gin Asn lie Val Gin Asp Tyr Cys Asp Trp lie 
835 840 845 

Gly lie Ser He Asp Met Lys Thr Leu Ala Leu Met Pro Asn He Asn 
850 855 860 

Leu Arg He Glu Gly He Leu Cys Thr Leu Asn Leu Asn Met Gin Thr 
865 870 875 880 

Lys Lys Ala Ser Met Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met 
885 890 895 

Asn Asn He Thr His Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe 
900 905 910 

Ala Asn Lys Thr Leu Asn Lys Leu Phe lie Ser Gly Gly Tyr Lys Tyr 
915 920 925 

Met Gin Cys Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala 
930 935 940 

Met Ser Ser Met lie Asp Leu Glu Val Ser Lys He He Tyr Ser Val 
945 950 955 960 

Thr Arg Ala Phe Phe Lys Tyr Leu Val Cys Asn He Lys Asp Thr He 
965 970 975. 

Phe Gly Glu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His 
980 585 990 
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Phe lie Glu lie Phe Ser Thr Lys Lys Tyr lie Phe Asn Arg Val Cys 
995 1000 1005 

Met He Leu Lys Ala Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin 
1010 1015 1020 

Ser Leu He Gin Tyr Asp Ala 
1025 1030 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CCCCAAAACC 


CCAAAACCCC 


AA&ACCCCTA 


TAAAAAAAGA 


AAAAATTGAG 


GTAGTTTAGA 


60 


AATAAAATAT 


TATTCCCGCA 


CAAATGGAGA 


TGGATATTGA 


TTTGGATGAT 


ATAGAAAATT 


120 


TACTTCCTAA 


TACATTCAAC 


AAGTATAGCA 


GCTCTTGTAG 


TGACAAGAAA 


GGATGCAAAA 


180 


CATTGAAATC 


TGGCTCGAAA 


TCGCCTTCAT 


TGACTATTCC 


AAAGTTGCAA 


AAACAATTAG 


24 0 


AGTTCTACTT 


CTCGGATGCA 


AATCTTTATA 


ACGATTCTTT 


CTTGAGAAAA 


TTAGTTTTAA 


300 


AAAGCGGAGA 


GCAAAGAGTA 


GAAATTGAAA 


CATTACTAAT 


GTTTAAATAA 


AATCAGGTAA 


360 


TGAGGATTAT 


TCTATTTTTT 


AGATCACTTC 


TTAAGGAGCA 


TTATGGAGAA 


AATTACTTAA 


420 


TACTAAAAGG 


TAAACAGTTT 


GGATTATTTC 


CCTAGCCAAC 


AATGATGAGT 


ATATTAAATT 


480 


CATATGAGAA 


TGAGTCAAAG 


GATCTCGATA 


CATCAGACTT 


ACCAAAGACA 


AACTCGCTAT 


540 


AAAACGCAAG 


AAAAAGTTTG 


ATAATCGAAC 


AGCAGAAGAA 


CTTATTGCAT 


TTACTATTCG 


600 


TATGGGTTTT 


ATTACAATTG 


TTTTAGGTAT 


CGACGGTGAA 


CTCCCGAGTC 


TTGAGACAAT 


660 


TGAAAAAGCT 


GTTTACAACT 


GAAGGAATCG 


CAGTTCTGAA 


AGTTCTGATG 


TGTATGCCAT 


720 


TATTTTGTGA 


ATTAATCTCA 


AATATCTTAT 


CTCAATTTAA 


TGGATAGCTA 


TAGAAACAAA 


780 


CCAAATAAAC 


CATGCAAGTT 


TAATGGAATA 


TACGTTAAAT 


CCTTTGGGAC 


AAATGCACAC 


840 


TGAATTTATA 


TTGGATTCTT 


AAAGCATAGA 


TACACAGAAT 


GCTTTAGAGA 


CTGATTTAGC 


900 


TTACAACAGA 


TTACCTGTTT 


TGATTACTCT 


TGCTCATCTC 


TTATATCTTT 


AAAAGAAGCA 


960 


GGCGAAATGA 


AAAGAAGACT 


AAAGAAAGAG 


ATTTCAAAAT 


TTGTTGATTC 


TTCTGTAACC 


1020 


GGAATTAACA 


ACAAGAATAT 


TAGCAACGAA 


AAAGAAGAAG 


AGCTATCACA 


ATCCTGATTC 


1080 


TTAAAGATTT 


CAAAAATTCC 


AGGTAAGAGA 


GATACATTCA 


TTAAAATTCA 


TATATTATAG 


1140 


TTTTTCATTT 


CACAGCTGTT 


ATTTTCTTTT 


ATCTTAACAA 


TATTTTTTGA 


TTAGCTGGAA 


1200 
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GTAAAAAGTA 


TCAAATAAGA 


GAAGCGCTAG 


ACTGAGGTAA 


CTTAGCTTAT 


TCACATTCAT 


1260 


AGATCGACCT 


TCATATATCC 


AATACGATGA 


TAAGGAAACA 


GCAGTCATCC 


GTTTTAAAAA 


1320 


TAGTGCTATG 


AGGACTAAAT 


TTTTAGAGTC 


AAGAAATGGA 


GCCGAAATCT 


TAATCAAAAA 


1380 


GAATTGCGTC 


GATATTGCAA 


AAGAATCGAA 


CTCTAAATCT 


TTCGTTAATA 


AGTATTAC C A 


1440 


ATCTTGATTG 


ATTGAAGAGA 


TTGACGAGGC 


AACTGCACAG 


AAGATCATTA 


AAGAAATAAA 


1500 


GTAACTTTTA 


TTAATTAGAG 


AATAAACTAA 


ATTACTAATA 


TAGAGATCAG 


CGATCTTCAA 


1560 


TTGACGAAAT 


AAAAGCTGAA 


CTAAAGTTAG 


ACAATAAAAA 


ATACAAACCT 


TGGTCAAAAT 


1620 


ATTGAGGAAG 


GAAAAGAAGA 


CCAGTTAGCA 


AAAGAAAAAA 


TAAGGCAATA 


AATAAAATGA 


1680 


GTACAGAAGT 


GAAGAAATAA 


AAGATTTATT 


TTTTTCAATA 


ATTTATTGAA 


AAGAGGGGTT 


1740 


TTGGGGTTTT 


GGGGTTTTGG 


GG 








1762 


(2) INFORMATION FOR SEQ ID NO : 4: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Pro Gin Asn Pro Lys Thr Pro Lys Pro Leu Lys Lys Lys Lys Leu Arg 
1 5 10 15 

Phe Arg Asn Lys lie Leu Phe Pro His Lys Trp Arg Trp lie Leu lie 
20 25 30 

Trp Met lie Lys lie Tyr Phe Leu lie His Ser Thr Ser lie Ala Ala 
35 40 45 

Leu Val Val Thr Arg Lys Asp Ala Lys His Cys Asn Leu Ala Arg Asn 
50 55 60 

Arg Leu His Cys Leu Phe Gin Ser Cys Lys Asn Asn Ser Ser Thr Ser 
65 70 75 80 

Arg Met Gin lie Phe lie Thr lie Leu Ser Cys Glu Asn Phe Lys Ala 
85 90 95 

Glu Ser Lys Glu Lys Leu Lys His Tyr Cys Leu Asn Lys lie Arg Cys 
100 105 HO 

Gly Leu Phe Tyr Phe Leu Asp His Phe Leu Arg Ser lie Met Glu Lys 
115 120 125 

lie Thr Tyr Lys Val Asn Ser Leu Asp Tyr Phe Pro Ser Gin Gin Cys 
130 135 140 

Cys Val Tyr lie His Met Arg Met Ser Gin Arg lie Ser He His Gin 
145 150 155 160 
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Thr Tyr Gin Arg Gin Thr Arg Tyr Lys Thr Gin Glu Lys Val Cys Ser 
165 170 175 

Asn Ser Arg Arg Thr Tyr Cys lie Tyr Tyr Ser Tyr Gly Phe Tyr Tyr 
180 185 190 

Asn Cys Phe Arg Tyr Arg Arg Cys Thr Pro Glu Ser Cys Asp Asn Cys 
195 200 205 

Lys Ser Cys Leu Gin Leu Lys Glu Ser Gin Phe Cys Lys Phe Cys Val 
210 215 220 

Cys His Tyr Phe Val Asn Ser Gin lie Ser Tyr Leu Asn Leu Met Asp 
225 230 235 240 

Ser Tyr Arg Asn Lys Pro Asn Lys Pro Cys Lys Phe Asn Gly lie Tyr 
245 250 255 

Val Lys Ser Phe Gly Thr Asn Ala His Cys lie Tyr lie Gly Phe Leu 
260 265 270 

Lys His Arg Tyr Thr Glu Cys Phe Arg Asp Cys Phe Ser Leu Gin Gin 
275 280 285 

He Thr Cys Phe Asp Tyr Ser Cys Ser Ser Leu He Ser Leu Lys Glu 
290 295 300 

Ala Gly Glu Met Lys Arg Arg Leu Lys Lys Glu He Ser Lys Phe Val 
305 310 315 320 

Asp Ser Ser Val Thr Gly He Asn Asn Lys Asn He Ser Asn Glu Lys 
325 330 335 

Glu Glu Glu Leu Ser Gin Ser Cys Phe Leu Lys He Ser Lys He Pro 
340 345 350 

Gly Lys Arg Asp Thr Phe He Lys lie His He Leu Phe Phe lie Ser 
355 360 365 

Gin Leu Leu Phe Ser Phe He Leu Thr He Phe Phe Asp Leu Glu Val 
370 375 380 

Lys Ser He Lys Glu Lys. Arg Thr Glu Val Thr Leu lie His He His 
385 390 395 400 

Arg Ser Thr Phe lie Tyr Pro He Arg Cys Gly Asn Ser Ser His Pro 
405 410 415 

Phe Lys Cys Tyr Glu Asp He Phe Arg Val Lys Lys Trp Ser Arg Asn 
420 425 430 

Leu Asn Gin Lys Glu Leu Arg Arg Tyr Cys Lys Arg He Glu Leu He 
435 440 445 

Phe Arg Val Leu Pro lie Leu He Asp Cys Arg Asp Arg Gly Asn Cys 
450 455 460 

Thr Glu Asp His Arg Asn Lys Val Thr Phe He Asn Arg He Asn He 
465 470 475 480 

Thr Asn He Glu He Ser Asp Leu Gin Leu Thr Lys Lys Leu Asn Ser 
485 490 495 



9 



Thr lie Lys Asn Thr Asn Leu Gly Gin Asn lie Glu Glu Gly Lys Glu 
500 505 510 

Asp Gin Leu Ala Lys Glu Lys lie Arg Gin lie Lys Cys Val Gin Lys 
515' 520 525 

Cys Arg Asn Lys Arg Phe lie Phe Phe Asn Asn Leu Leu Lys Arg Gly 
530 535 540 

Val Leu Gly Phe Trp Gly Phe Gly 
545 550 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 562 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Pro Lys Thr Pro Lys Pro Gin Asn Pro Tyr Lys Lys Arg Lys Asn Cys 
1 5 10 15 

Gly Ser Leu Glu lie Lys Tyr Tyr Ser Arg Thr Asn Gly Asp Gly Tyr 
20 25 30 

Cys Phe Gly Cys Tyr Arg Lys Phe Thr Ser Tyr lie Gin Gin Val Gin 
35 40 45 

Leu Leu Gin Glu Arg Met Gin Asn lie Glu lie Trp Leu Glu lie Ala 
50 55 60 

Phe lie Asp Tyr Ser Lys Val Ala Lys Thr lie Arg Val Leu Leu Leu 
65 70 75 80 

Gly Cys Lys Ser Leu Arg Phe Phe Leu Glu Lys lie Ser Phe Lys Lys 
85 90 95 

Arg Arg Ala Lys Ser Arg Asn Cys Asn lie Thr Asn Val lie Lys Ser 
100 105 110 

Gly Asn Glu Asp Tyr Ser He Phe He Thr Ser Gly Ala Leu Trp Arg 
115 120 125 

Lys Leu Leu Asn Thr Lys Arg Thr Val Trp He He Ser Leu Ala Asn 
130 135 140 

Asn Asp Glu Tyr He Lys Phe He Cys Glu Cys Val Lys Gly Ser Arg 
145 150 155 160 

Tyr He Arg Leu Thr Lys Asp Lys Leu Ala He Lys Arg Lys Lys Lys 
165 170 175 

Phe Asp Asn Arg Thr Ala Glu Glu Leu He Ala Phe Thr He Arg Met 
180 185 190 

Gly Phe He Thr lie Val Leu Gly He Asp Gly Glu Leu Pro Ser Leu 
195 200 205 
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Glu Thr lie Glu Lys 
210 

Ser Ser Asp Val Tyr 
225 

lie Ser lie Trp lie 
245 

Leu Met Glu Tyr Thr 
260 

lie Leu Asp Ser Ser 
275 

Ala Tyr Asn Arg Leu 
290 

Leu Lys Lys Gin Ala 
305 

Leu Leu He Leu Leu 
325 

Lys Lys Lys Ser Tyr 
340 

Val Arg Glu He His 
355 

His Ser Cys Tyr Phe 
370 

Lys Val Ser Asn Lys 
385 

Phe He Asp Arg Pro 
405 

Val He Arg Phe Lys 
420 

Arg Asn Gly Ala Glu 
435 

Lys Glu Ser Asn Ser 
450 

Leu He Glu Glu He 
465 

He Lys Leu Leu Leu 
485 

He Phe Asn Cys Arg 
500 

Gin Thr Leu Val Lys 
515 

Lys Lys Gly Asn Lys 
530 



Ala Val Tyr Asn Cys Arg 
215 

Ala He He Leu Cys He 
230 235 

Ala He Glu Thr Asn Gin 
250 

Leu Asn Pro Leu Gly Gin 
265 

He Asp Thr Gin Asn Ala 
280 

Pro Val Leu He Thr Leu 
295 

Lys Cys Lys Glu Asp Arg 
310 315 

Pro Glu Leu Thr Thr Arg 
330 

His Asn Pro Asp Ser Arg 
345 

Ser Leu Lys Phe lie Tyr 
360 

Leu Leu Ser Gin Tyr Phe 
375 

Arg Ser Ala Arg Leu Arg 
390 395 

Ser Tyr He Gin Tyr Asp 
410 

Asn Ser Ala Met Arg Thr 
425 

He Leu He Lys Lys Asn 
440 

Lys Ser Phe Val Asn Lys 
455 

Asp Glu Ala Thr Ala Gin 
470 475 

He Arg Glu Thr Lys Leu 
490 

Asn Lys Ser Cys Thr Lys 
505 

He Leu Arg Lys Glu Lys 
520 

Asn Glu Tyr Arg Ser Glu 
535 




Asn Arg Ser Ser Glu 
220 

Asn Leu Lys Tyr Leu 
240 

He Asn His Ala Ser 
255 

Met His Thr Glu Phe 
270 



Leu Glu Thr Asp Leu 
285 

Ala His Leu Leu Tyr 
300 

Lys Arg Phe Gin Asn 
320 

lie Leu Ala Thr Lys 
335 



Phe Gin Lys Phe Gin 
350 

Tyr Ser Phe Ser Phe 
365 

Leu lie Ser Trp Lys 
380 

Leu Ser Leu Phe Thr 
400 

Asp Lys Glu Thr Ala 
415 

Lys Phe Leu Glu Ser 
430 

Cys Val Asp He Ala 
445 

Tyr Tyr Gin Ser Cys 
460 

Lys He He Lys Glu 
480 

Leu He Arg Ser Ala 
495 

Val Arg Gin Lys He 
510 

Lys Thr Ser Gin Lys 
525 

Glu He Lys Asp Leu 
540 
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Phe Phe Ser lie lie Tyr Cys Lys Glu Gly Phe Trp Gly Phe Gly Val 
545 550 555 560 

Leu Gly 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Pro Lys Pro Gin Asn Pro Lys Thr Pro lie Lys Lys Glu Lys lie Glu 
1 5 10 15 

Val Val Lys Asn He He Pro Ala Gin Met Glu Met Asp He Asp Leu 
20 25 . 30 

Asp Asp He Glu Asn Leu Leu Pro Asn Thr Phe Asn Lys Tyr Ser Ser 
35 40 45 

Ser Cys Ser Asp Lys Lys Gly Cys Lys Thr Leu Lys Ser Gly Ser Lys 
50 55 60 

Ser Pro Ser Leu Thr lie Pro Lys Leu Gin Lys Gin Leu Glu Phe Tyr 
65 70 75 80 

Phe Ser Asp Ala Asn Leu Tyr Asn Asp Ser Phe Leu Arg Lys Leu Val 
85 90 95 

Leu Lys Ser Gly Glu Gin Arg Val Glu He Glu Thr Leu Leu Met Phe 
100 105 110 

Lys Asn Gin Val Met Arg He He Leu Phe Phe Arg Ser Leu Leu Lys 
115 120 125 

Glu His Tyr Gly Glu Asn Tyr Leu He Leu Lys Gly Lys Gin Phe Gly 
130 135 140 

Leu Phe Pro Pro Thr Met Met Ser He Leu Asn Ser Tyr Glu Asn Glu 
145 150 155 160 

Ser Lys Asp Leu Asp Thr Ser Asp Leu Pro Lys Thr Asn Ser Leu Asn 
165 170 175 

Ala Arg Lys Ser Leu He He Glu Gin Gin Lys Asn Leu Leu His Leu 
180 185 190 

Leu Phe Val Trp Val Leu Leu Gin Leu Phe Val Ser Thr Val Asn Ser 
195 200 205 

Arg Val Leu Arg Gin Leu Lys Lys Leu Phe Thr Thr Glu Gly He Ala 
210 215 220 

Val Leu Lys Val Leu Met Cys Met Pro Leu Phe Cys Glu Leu He Ser 
225 230 235 240 
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Asn lie Leu Ser 



Val Trp Asn lie 
260 

Trp lie Leu Lys 
275 

Asp Tyr Leu Phe 
290 

Ser Arg Arg Asn 
305 

Cys Phe Phe Cys 



Arg Ala lie Thr 
340 

Arg Tyr lie His 
355 

lie Phe Phe Tyr 
370 

Tyr Gin lie Arg 
385 

Ser lie Asp Leu 



Ser Val Leu Lys 
420 

Glu Pro Lys Ser 
435 

Thr Leu Asn Leu 
450 

Arg Leu Thr Arg 
465 

Tyr Leu Glu Asn 



Asp Glu lie Lys 
500 

Trp Ser Lys Tyr 
515 

Asn Lys Ala lie 
530 

Phe Phe Gin Phe 
545 




Gin Phe Asn Gly 
245 

Arg lie Leu Trp 



Ala lie His Arg 
280 

Cys Leu Leu Leu 
295 

Glu Lys Lys Thr 
310 

Asn Arg Asn Gin 
325 

lie Leu lie Leu 



Asn Ser Tyr lie 
360 

Leu Asn Asn lie 
375 

Glu Ala Leu Asp 
390 

His lie Ser Asn 
405 

lie Val Leu Cys 



Ser Lys Arg lie 
440 

Ser Leu lie Ser 
455 

Gin Leu His Arg 
470 

Lys Leu Asn Tyr 
485 

Ala Glu Leu Lys 



Cys Gly Arg Lys 
520 



Asn Lys Met Ser 
535 

lie Glu Lys Arg 
550 



Leu Lys Gin Thr 
250 

Asp Lys Cys Thr 
265 

Met Leu Arg Leu 



Leu lie Ser Tyr 
300 

Lys Glu Arg Asp 
315 

Gin Glu Tyr Gin 
330 

Lys Asp Phe Lys 
345 

lie Val Phe His 



Phe Cys Leu Ala 
380 

Cys Gly Asn Leu 
395 

Thr Met lie Arg 
410 

Gly Leu Asn Phe 
425 

Ala Ser He Leu 



He Thr Asn Leu 
460 

Arg Ser Leu Lys 
475 

Tyr Arg Asp Gin 
490 

Leu Asp Asn Lys 
505 

Arg Arg Pro Val 



Thr Glu Val Lys 
540 

Gly Phe Gly Val 
555 




Lys Thr Met Gin 
255 

Leu Asn Leu Tyr 
270 

He Leu Thr Thr 
285 

He Phe Lys Arg 



Phe Lys He Cys 
320 

Arg Lys Arg Arg 
335 

Asn Ser Arg Glu 
350 

Phe Thr Ala Val 
365 

Gly Ser Lys Lys 



Ala Tyr Ser His 
400 

Lys Gin Gin Ser 
415 

Ser Gin Glu Met 
430 

Gin Lys Asn Arg 
445 

Asp Cys Leu Lys 



Lys Ser Asn Phe 
480 

Arg Ser Ser He 
495 

Lys Tyr Lys Pro 
510 

Ser Lys Arg Lys 
525 

Lys Lys He Tyr 



Leu Gly Phe Trp 
560 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 719 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Glu lie Glu Asn Asn Gin Ala Gin Gin Pro Lys Ala Glu Lys Leu 
1 5 10 15 

Trp Trp Glu Leu Glu Leu Glu Met Gin Glu Asn Gin Asn Asp lie Gin 
20 25 30 

Val Arg Val Lys lie Asp Asp Pro Lys Gin Tyr Leu Val Asn Val Thr 
35 40 45 

Ala Ala Cys Leu Leu Gin Glu Gly Ser Tyr Tyr Gin Asp Lys Asp Glu 
50 55 60 

Ara Arq Tyr lie lie Thr Lys Ala Leu Leu Glu Val Ala Glu Ser Asp 
65 70 75 80 

Pro Glu Phe lie Cys Gin Leu Ala Val Tyr lie Arg Asn Glu Leu Tyr 
85 90 95 

lie Arg Thr Thr Thr Asn Tyr lie Val Ala Phe Cys Val Val His Lys 
100 105 110 

Asn Thr Gin Pro Phe lie Glu Lys Tyr Phe Asn Lys Ala Val Leu Leu 
115 120 125 

Pro Asn Asp Leu Leu Glu Val Cys Glu Phe Ala Gin Val Leu Tyr He 
130 135 140 

Phe Asp Ala Thr Glu Phe Lys Asn Leu Tyr Leu Asp Arg He Leu Ser 
145 150 ' 155 160 

Gin Asp He Arg Lys Glu Leu Thr Phe Arg Lys Cys Leu Gin Arg Cys 
165 170 175 

Val Arg Ser Lys Phe Ser Glu Phe Asn Glu Tyr Gin Leu Gly Lys Tyr 
180 185 190 

Cys Thr Glu Ser Gin Arg Lys Lys Thr Met Phe Arg Tyr Leu Ser Val 
195 200 205 

Thr Asn Lys Gin Lys Trp Asp Gin Thr Lys Lys Lys Arg Lys Glu Asn 
210 215 220 

Leu Leu Thr Lys Leu Gin Ala He Lys Glu Ser Glu Asp Lys Ser Lys 
225 230 235 240 

Arg Glu Thr Gly Asp He Met Asn Val Glu Asp Ala He Lys Ala Leu 
245 250 255 

Lys Pro Ala Val Met Lys Lys He Ala Lys Arg Gin Asn Ala Met Lys 
260 265 270 
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Lys His Met Lys Ala Pro Lys lie Pro Asn Ser Thr Leu Glu Ser Lys 
275 280 285 

Tyr Leu Thr Phe Lys Asp Leu lie Lys Phe Cys His lie Ser Glu Pro 
290 295 300 

Lys Glu Arg val Tyr Lys lie Leu Gly Lys Lys Tyr Pro Lys Thr Glu 
305 310 315 320 

Glu Glu Tyr Lys Ala Ala Phe Gly Asp Ser Ala Ser Ala Pro Phe Asn 
325 330 335 

Pro Glu Leu Ala Gly Lys Arg Met Lys lie Glu lie Ser Lys Thr Trp 
340 345 350 

Glu Asn Glu Leu Ser Ala Lys Gly Asn Thr Ala Glu Val Trp Asp Asn 
355 360 365 

Leu lie Ser Ser Asn Gin Leu Pro Tyr Met Ala Met Leu Arg Asn Leu 
370 375 380 

Ser Asn lie Leu Lys Ala Gly Val Ser Asp Thr Thr His Ser lie Val 
385 390 395 400 

lie Asn Lys lie Cys Glu Pro Lys Ala Val Glu Asn Ser Lys Met Phe 
405 410 415 

Pro Leu Gin Phe Phe Ser Ala lie Glu Ala Val Asn Glu Ala Val Thr 
420 425 430 

Lys Gly Phe Lys Ala Lys Lys Arg Glu Asn Met Asn Leu Lys Gly Gin 
435 440 445 

lie Glu Ala Val Lys Glu Val Val Glu Lys Thr Asp Glu Glu Lys Lys 
450 455 460 

Asp Met Glu Leu Glu Gin Thr Glu Glu Gly Glu Phe Val Lys Val Asn 
465 470 475 480 

Glu Gly lie Gly Lys Gin Tyr lie Asn Ser lie Glu Leu Ala He Lys 
485 490 495 

He Ala Val Asn Lys Asn Leu Asp Glu He Lys Gly His Thr Ala lie 
500 505 510 

Phe Ser Asp Val Ser Gly Ser Met Ser Thr Ser Met Ser Gly Gly Ala 
515 520 525 

Lys Lys Tyr Gly Ser Val Arg Thr Cys Leu Glu Cys Ala Leu Val Leu 
530 535 540 

Gly Leu Met Val Lys Gin Arg Cys Glu Lys Ser Ser Phe Tyr He Phe 
545 550 555 560 

Ser Ser Pro Ser Ser Gin Cys Asn Lys Cys Tyr Leu Glu Val Asp Leu 
565 570 575 

Pro Gly Asp Glu Leu Arg Pro Ser Met Gin Lys Leu Leu Gin Glu Lys 
580 585 590 

Gly Lys Leu Gly Gly Gly Thr Asp Phe Pro Tyr Glu Cys He Asp Glu 
595 600 605 
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Trp Thr Lys Asn Lys Thr His Val Asp Asn lie Val lie Leu Ser Asp 
610 615 620 

Met Met lie Ala Glu Gly Tyr Ser Asp lie Asn Val Arg Gly Ser Ser 
625 630 635 640 

lie Val Asn Ser lie Lys Lys Tyr Lys Asp Glu Val Asn Pro Asn lie 
645 650 655 

Lys lie Phe Ala Val Asp Leu Glu Gly Tyr Gly Lys Cys Leu Asn Leu 
660 665 670 

Gly Asp Glu Phe Asn Glu Asn Asn Tyr He Lys He Phe Gly Met Ser 
675 680 685 

Asp Ser He Leu Lys Phe He Ser Ala Lys Gin Gly Gly Ala Asn Met 
690 695 700 

Val Glu Val He Lys Asn Phe Ala Leu Gin Lys lie Gly Gin Lys 
705 710 715 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 872 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Ser Arg Arg Asn Gin Lys Lys Pro Gin Ala Pro He Gly Asn Glu 
1 5 10 15 

Thr Asn Leu Asp Phe Val Leu Gin Asn Leu Glu Val Tyr Lys Ser Gin 
20 25 30 

He Glu His Tyr Lys Thr Gin Gin Gin Gin lie Lys Glu Glu Asp Leu 
35 40 45 

Lys Leu Leu Lys Phe Lys Asn Gin Asp Gin Asp Gly Asn Ser Gly Asn 
50 55 60 

Asp Asp Asp Asp Glu Glu Asn Asn Ser Asn Lys Gin Gin Glu Leu Leu 
65 70 75 80 

Arg Arg Val Asn Gin He Lys Gin Gin Val Gin Leu He Lys Lys Val 
85 90 95 

Gly Ser Lys Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Lys 
100 105 110 

Lys Asn Gly Leu Ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr 
115 120 125 

He Thr Glu Glu Gin Val Lys Tyr Gin Asn Leu Val Phe Asn Met Asp 
130 135 140 

Tyr Gin Leu Asp Leu Asn Glu Ser Gly Gly His Arg Arg His Arg Arg 
145 150 155 160 
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Glu Thr Asp Tyr Asp Thr Glu Lys Trp Phe Glu He Ser His Asp Gin 
165 170 175 

Lys Asn Tyr Val Ser He Tyr Ala Asn Gin Lys Thr Ser Tyr Cys Trp 
180 185 190 

Trp Leu Lys Asp Tyr Phe Asn Lys Asn Asn Tyr Asp His Leu Asn Val 
195 200 205 

Ser He Asn Arg Leu Glu Thr Glu Ala Glu Phe Tyr Ala Phe Asp Asp 
210 215 220 

Phe Ser Gin Thr He Lys Leu Thr Asn Asn Ser Tyr Gin Thr Val Asn 
225 230 235 240 

He Asp Val Asn Phe Asp Asn Asn Leu Cys He Leu Ala Leu Leu Arg 
245 250 255 

Phe Leu Leu Ser Leu Glu Arg Phe Asn He Leu Asn He Arg Ser Ser 
260 265 270 

Tyr Thr Arg Asn Gin Tyr Asn Phe Glu Lys He Gly Glu Leu Leu Glu 
275 280 285 

Thr He Phe Ala Val Val Phe Ser His Arg His Leu Gin Gly He His 
290 295 300 

Leu Gin Val Pro Cys Glu Ala Phe ■ Gin Tyr Leu Val Asn Ser Ser Ser 
305 310 315 320 

Gin He Ser Val Lys Asp Ser Gin Leu Gin Val Tyr Ser Phe Ser Thr 
325 330 335 

Asp Leu Lys Leu Val Asp Thr Asn Lys Val Gin Asp Tyr Phe Lys Phe 
340 345 350 

Leu Gin Glu Phe Pro Arg Leu Thr His Val Ser Gin Gin Ala lie Pro 
355 360 365 

Val Ser Ala Thr Asn Ala Val Glu Asn Leu Asn Val Leu Leu Lys Lys 
370 375 380 

Val Lys His Ala Asn Leu Asn Leu Val Ser He Pro Thr Gin Phe Asn 
385 390 395 400 

Phe Asp Phe Tyr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly 
405 410 415 

Leu Glu Pro Asn He Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu 
420 425 430 

Ser He Lys Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr 
435 440 445 

Thr Tyr Val Ala Gin Glu Thr Ser Arg Lys Gin He Leu Lys Gin Ala 
450 455 460 

Thr Thr He Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro 
465 470 475 480 

Glu Thr Lys Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe 
485 490 495 
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Phe Asp His Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn 
500 505 510 

Leu Gin Ala Thr Gin Glu lie Tyr Asp Ser Leu His Lys Leu Leu lie 
515 520 525 

Arg Ser Thr Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met 
530 535 540 

Glu Lys Ser Lys Met Asp Thr Phe lie Asp Leu Lys Asn lie Tyr Glu 
545 550 555 560 

Thr Leu Asn Asn Leu Lys Arg Cys Ser Val Asn lie Ser Asn Pro His 
565 570 575 

Gly Asn lie Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys 
580 585 590 

Phe Lys Leu Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe 
595 600 605 

Lys Gin Asn Glu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys He Glu 
610 615 620 

Ser Ser Ser Leu Glu Ser Leu Glu Asp He Asp Ser Leu Cys Lys Ser 
625 630 635 ' 640 

He Ala Ser Cys Lys Asn Leu Gin Asn Val Asn He He Ala Ser Leu 
645 650 655 

Leu Tyr Pro Asn Asn He Gin Lys Asn Pro Phe Asn Lys Pro Asn Leu 
660 665 670 

Leu Phe Phe Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser 
675 680 685 

He Asn Cys He Leu Asp Gin His He Leu Asn Ser He Ser Glu Phe 
690 695 700 

Leu Glu Lys Asn Lys Lys He Lys Ala Phe He Leu Lys Arg Tyr Tyr 
705 710 715 720 

Leu Leu Gin Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr Leu Gin 
725 730 735 

Gin Leu Pro Glu Leu Asn Gin Val Tyr He Asn Gin Gin Leu Glu Glu 
740 745 750 

Leu Thr Val Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin 
755 760 765 

Lys Ala Phe Tyr Glu Pro Leu Cys Glu Phe He Lys Glu Ser Ser Gin 
770 775 780 

Thr Leu Gin Leu lie Asp Phe Asp Gin Asn Thr Val Ser Asp Asp Ser 
785 790 795 800 

lie Lys Lys He Leu Glu Ser He Ser Glu Ser Lys Tyr His His Tyr 
805 810 815 

Leu Arg Leu Asn Pro Ser Gin Ser Ser Ser Leu He Lys Ser Glu Asn 
820 825 830 
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Glu Glu lie Gin Glu Leu Leu Lys Ala Cys Asp Glu Lys Gly Val Leu 
835 840 845 

Val Lys Ala Tyr Tyr Lys Phe Pro Leu Cys Leu Pro Thr Gly Thr Tyr 
850 855 860 

Tyr Asp Tyr Asn Ser Asp Arg Trp 
865 870 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Asp He Asp Leu Asp Asp He Glu Asn Leu Leu Pro Asn Thr Phe Asn 
1 5 10 15 

Lys Tyr Ser Ser Ser Cys Ser Asp Lys Lys Gly Cys Lys Thr Leu Lys 
20 25 30 

Ser Gly Ser Lys Ser Pro Ser Leu Thr. He Pro Lys Leu Gin Lys Gin 
35 40 45 

Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn Asp Ser Phe Leu 
50 55 60 

Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val Glu He Glu Thr 
65 70 75 80 

Leu Leu Met 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 : 

Asn Val Lys Ser Ala Lys He Glu Ser Ser Ser Leu Glu Ser Leu Glu 
1 5 10 15 

Asp He Asp Ser Leu Cys Lys Ser lie Ala Ser Cys Lys Asn Leu Gin 
20 25 30 

Asn Val- Asn He He Ala Ser Leu Leu Tyr Pro Asn Asn He Gin Lys 
35 40 45 

Asn Pro Phe Asn Lys Pro Asn Leu Leu Phe Phe Lys Gin Phe Glu Gin 
50 55 60 
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Leu Lys Asn Leu Glu Asn Val Ser lie Asn Cys lie Leu Asp Gin His 
65 70 75 80 

lie Leu Asn Ser lie Ser Glu Phe Leu Glu Lys Asn Lys Lys lie Lys 
85 90 95 

Ala Phe lie Leu 
100 

INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Glu Met Asp lie Asp Leu Asp Asp lie Glu Asn Leu Leu Pro Asn 
1 5 10 15 

Thr Phe Asn Lys Tyr Ser Ser Ser Cys Ser Asp Lys Lys Gly Cys Lys 
20 25 30 

Thr Leu Lys Ser Gly Ser Lys Ser Pro Ser Leu Thr lie Pro Lys Leu 
35 40 45 

Gin Lys Gin Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn Asp 
50 55 60 

Ser Phe Leu Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val Glu 
65 70 75 80 

lie Glu Thr Leu Leu 
85 

INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

lie Glu Leu Ala lie Lys lie Ala Val Asn Lys Asn Leu Asp Glu lie 
1 5 10 15 

Lys Gly His Thr Ala He Phe Ser Asp Val Ser Gly Ser Met Ser Thr 
2 0 25 3 0 

Ser Met Ser Gly Gly Ala Lys Lys Tyr Gly Ser Val Arg Thr Cys Leu 
35 40 45 

Glu Cys Ala Leu Val Leu Gly Leu Met Val Lys Gin Arg Cys Glu Lys 
50 55 60 

20 



Ser Ser Phe Tyr lie Phe Ser Ser Pro Ser Ser Gin Cys Lys Cys Tyr 
65 70 75 80 

Leu Glu Val Asp Leu Pro Gly Asp Glu Leu Arg Pro Ser Met Gin Lys 
85 90 95 

Leu Leu 

(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Gin Pro Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr 
1 5 10 15 

Asp Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
20 25 30 

Leu Leu Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys 
35 40 45 

Val Ser Ser lie Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser 
50 55 60 

Ser Leu Gly Phe Leu 
65 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Lys Asn Arg Asn Leu His Cys Thr Tyr lie Asp Tyr Lys Lys Ala Phe 
1 5 10 15 

Asp Ser lie Pro His Ser Trp Leu lie Gin Val Leu Glu lie Tyr Lys 
20 25 30 

lie Asn Arg Gin lie Ala lie Lys Lys Gly He Tyr Gin Gly Asp Ser 
35 40 45 

Leu Ser Pro Leu Trp Phe Cys Leu Ala Leu Asn Pro Leu Ser His Gin 
50 55 60 

Leu His Asn Asp Arg 
65 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Phe Gly Gly Ser Asn Trp Phe Arg Glu Val Asp Leu Lys Lys Cys Phe 
1 5 10 15 

Asp Thr He Ser His Asp Leu He He Lys Glu Leu Lys Arg Tyr He 
20 25 30 

Ser Asp His Val Pro Val Gly Pro Arg Val Cys Val Gin Gly Ala Pro 
35 40 45 

Thr Ser Pro Ala Leu Cys Asn Ala Val Leu Leu Arg Leu Asp Arg Arg 
50 55 60 

Leu Ala Gly Leu Ala 
65 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr 
1 5 10 15 

Phe Ser Val Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr 
20 25 30 

He Pro Gly He Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp Lys 
35 40 45 

Gly Ser Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro 
50 55 60 

Phe Arg Lys Gin Asn 
65 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Val Leu Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr 
1 5 10 15 

Asp Ser lie Pro Arg Met Glu Cys Met Arg lie Leu Lys Asp Ala Leu 
20 25 30 

Lys Asn Lys Cys Tyr lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser 
35 40 45 

Leu Ser Ala Pro lie Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe 
50 55 60 

Tyr Ser Glu Phe Lys 
65 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu lie Thr Thr Gin Glu Asn 
1 5 10 15 

Asn Ala Val Leu Phe lie Glu Lys Leu lie Asn Val Ser Arg Glu Asn 
20 25 30 

Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr- Gin Asp Tyr Cys Asp 
35 40 45 

Trp lie Gly lie Ser lie 
50 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 

His Leu lie Tyr Met Asp Asp lie Lys Leu Tyr Ala Lys Asn Asp Lys 
1 5 10 15 

Glu Met Lys Lys Leu lie Asp Thr Thr Thr lie Phe Ser Asn Asp lie 
20 25 30 

Ser Met Gin Phe Gly Leu Asp Lys Cys Lys Thr Lys Cys Leu Tyr Lys 
35 40 45 

Tyr Leu Gly Phe Gin Gin 
50 

INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Tyr Val Arg Tyr Ala Asp Asp lie Leu lie Gly Val Leu Gly Ser Lys 
1 5 10 15 

Asn Lys lie lie Lys Arg Asp Leu Asn Asn Phe Leu Asn Ser Leu Gly 
20 25 30 

Leu Thr lie Asn Glu Glu Lys Thr Leu lie Glu Thr Pro Ala Arg Phe 
35 40 45 

Leu Gly Tyr Asn lie 
50 

INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

lie Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser His Leu Glu lie 
1 5 10 15 

Gly His Arg Thr Lys lie Glu Glu Leu Arg Gin His Leu Leu Arg Trp 
20 25 30 

Gly Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu 
35 40 45 

Trp Met Gly Tyr Glu Leu 
50 
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(2) INFORMATION FOR SEQ ID NO: 22; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie Ser Thr Asp Gin Gin 
1 5 10 15 

Gin Val He Asn He Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr 
20 25 30 

Asn Ala Lys Ala Asn Arg He Arg Ser Lys Ser Ser Lys Gly He Phe 
35 40 45 

Arg 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Leu Gin Lys Gin Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn 
1 5 10 15 

Asp Ser Phe Leu Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val 
20 25 30 

Glu lie Glu Thr Leu Leu Met 
35 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO;24; 

He Cys His Gin Glu Tyr Tyr Phe Gly Asp Phe Asn Leu Pro Arg Asp 
1 5 10 15 

Lys Phe Leu Lys Glu Gin He Lys Leu Asp Glu Gly Trp Val Pro Leu 
20 25 30 

Glu He Met He Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

He Cys Glu Gin He Glu Tyr Tyr Phe Gly Asp His Asn Leu Pro Arg 
1 5 10 15 

Asp Lys Phe Leu Lys Gin Gin He Leu Leu Asp Asp Gly Trp Val Pro 
20 25 30 

Leu Glu Thr Met He Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
CD) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

He Leu Arg Gin Val Glu Tyr Tyr Phe Gly Asp Ala Asn Leu Asn Arg 
15 10 15 

Asp Lys Phe Leu Arg Glu Gin He Gly Lys Asn Glu Asp Gly Trp Val 
20 25 30 

Pro Leu Ser Val Leu Val Thr 
35 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



Cys 
1 



Leu Lys Gin Val Glu Phe Tyr Phe Ser Glu Phe Asn Phe Pro Tyr 
5 10 15 



Asp 



Arg Phe Leu Arg Thr Thr Ala Glu Lys Asn Asp Gly Trp Val Pro 
20 25 30 



He 



Ser Thr He Ala Thr 
35 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TAGAC CTGTT AGTGTACATT TGAATTGAAG C 31 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

TAGAC CTGTT AGGTTGGATT TGTGGCATCA 3 0 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CAAAACCCCA AAACCTAACA GGTCTA 26 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

GCGGGAATTC TAATACGACT CACTATAGGG AAGAAACTCT GATGAGGCCG AAAGGCCGAA 60 

ACTCCACGAA AGTGGAGTAA GTTTCTCGAT AATTGATCTG TAG 103 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION : . /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CGGGGATCCT CTTCAAAAGA TGAGAGGACA GCAAAC 36 

(2 ) INFORMATION FOR SEQ ID NO :33s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc » "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

CCCCAAAACC CCAAAACCCC AAAACCCCCA CAGGGGTTTT GGGGTTTTGG GGTTTTGGGG 6 0 

(2) INFORMATION FOR SEQ ID NO: 34; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CCAAAACCCC AAAACCCCAA AACCCCCACA GGGGTTTTGG GGTTTTGGGG TTTTGGGG 58 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc * "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 35: 

AAAACCCCAA AACCCCAAAA CCCCCACAGG GGTTTTGGGG TTTTGGGGTT TTGGGG 56 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc - "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AACCCCAAAA CCCCAAAACC CCCACAGGGG TTTTGGGGTT TTGGGGTTTT GGGG 54 

(2) INFORMATION FOR SEQ ID NO: 37 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS • single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc » "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

CCCCAAAACC CCAAAACCCC CACAGGGGTT TTGGGGTTTT GGGGTTTT 4 8 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

AAAACCCCAA AACCCCAAAA CCCCCACAGG GGTTTTGGGG TTTTGGGGTT TT 52 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AACCCCAAAA CCCCAAAACC CCCACAGGGG TTTTGGGGTT TTGGGGTTTT 50 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

CCCCAAAACC CCAAAACCCC CACAGGGGTT TTGGGGTTTT GGGGTTTT 4 8 

(2) INFORMATION FOR SEQ ID NO; 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

CCAAAACCCC. AAAACCCCCA CAGGGGTTTT GGGGTTTTGG GGTTTT 46 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42 : 
AAAACCCCAA AACCCCCACA GGGGTTTTGG GGTTTTGGGG TTTT 44 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "RNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 

CAAAACCCCA AAACC 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

TTTTGGGG 

(2) INFORMATION FOR SEQ ID NO : 45 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = . " RNA " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 

CAAAACCCCA AAACC 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 



GGGGTTTT 



(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

TCTRAARTAR TGDGTNADRT TRTTCAT 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc « "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48.: 

GCGGATC CAT GAAYCCWGAR AAYCCWAAYG T 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc « "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49 

NNNGTNACHG GHATHAAYAA 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc =r "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50 

DGCDGTYTCY TGRTCRTTRT A 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

AACT CATTT A ATTACTAATT TAATCAACAA GATTGATAAA AAGCAGTAAA TAAAACCCAA 60 

TAGATTTAAT TTAGAAAGTA TCAATTGAAA AATGGAAATT GAAAACAACT AAGCACAATA 120 

GCCAAAAGCC GAAAAATTGT GGTGGGAACT TGAATTAGAG ATGCAAGAAA AC C AAAATGA 18 0 

TATATAAGTT AGGGTTAAGA TTGACGATCC TAAGCAATAT CTCGTGAACG TCACTGCAGC 240 

ATGTTTGTTG TAGGAAGGTA GTTACTACTA AGATAAAGAT GAAAGAAGAT ATATCATCAC 300 

TAAAGCACTT CTTGAGGTGG CTGAGTCTGA TCCTGAGTTC ATCTGCTAGT TGGCAGTCTA 36 0 

CAT C CGTAAT GAACTTTACA TCAGAACTAC CACTAACTAC ATTGTAGCAT TTTGTGTTGT 42 0 

CCACAAGAAT ACTCAACCAT TCATCGAAAA GTACTTCAAC AAAGCAGTAC TTTTGCCTAA 48 0 

TGACTTACTG GAAGT CTGTG AATTTGCATA GGTTCTCTAT ATTTTTGATG CAACTGAATT 540 

CAAAAATTTG TATCTTGATA GGATACTTTC ATAAGATATT CGTAAGGAAC TCACTTTCCG 600 

TAAGTGTTTA CAAAGATGCG TCAGAAGCAA GTTTTCTGAA TTCAACGAAT ACTAACTTGG 660 

TAAGTATTGC ACTGAATCCT AACGTAAGAA AACAATGTTC CGTTACCTCT CAGTTACCAA 72 0 

CAAGTAAAAG TGGGATTAAA CTAAGAAGAA GAGAAAAGAG AATCTCTTAA C C AAACTTT A 78 0 

GGCAATAAAG GAATCTGAAG ATAAGTCCAA GAGAGAAACT GGAGACATAA TGAACGTTGA 84 0 

AGATGCAATC AAGGCTTTAA AACCAGCAGT TATGAAGAAA ATAGC CAAGA GATAGAATGC 900 

CATGAAGAAA CACATGAAGG CACCTAAAAT TCCTAACTCT ACCTTGGAAT CAAAGTACTT 960 

GAC CTTCAAG GATCTCATTA AGTTCTGCCA TATTTCTGAG CCTAAAGAAA GAGTCTATAA 1020 

GATCCTTGGT AAAAAATACC CTAAGACCGA AGAGGAATAC AAAGCAGCCT TTGGTGATTC 1080 

TGCATCTGCA CCCTTCAATC CTGAATTGGC TGGAAAGCGT ATGAAGATTG AAATCTCTAA 1140 

AACATGGGAA AATGAACTCA GTGCAAAAGG CAACACTGCT GAGGTTTGGG ATAATTTAAT 1200 

TTCAAGCAAT TAACTCCCAT ATATGGC CAT GTTACGTAAC TTGTCTAACA TCTTAAAAGC 1260 

CGGTGTTTCA GATACTACAC ACTCTATTGT GATCAACAAG ATTTGTGAGC CCAAGGCCGT 132 0 

TGAGAACTCC AAGATGTTCC CTCTTCAATT CTTTAGTGCC ATTGAAGCTG TTAATGAAGC 13 8 0 

AGTTACTAAG GGATTCAAGG CCAAGAAGAG AGAAAATATG AATCTTAAAG GTCAAATCGA 1440 

AGCAGTAAAG GAAGTTGTTG AAAAAACCGA TGAAGAGAAG AAAGATATGG AGTTGGAGTA 15 0 0 
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AACCGAAGAA 


GGAGAATTTG 


TTAAAGTCAA 


CGAAGGAATT 


GGCAAGCAAT 


ACATTAACTC 


1560 


CATTGAACTT 


GCAATCAAGA 


TAGCAGTTAA 


CAAGAATTTA 


GAT.GAAATCA 


AAGGACACAC 


1620 


TGCAATCTTC 


TCTGATGTTT 


CTGGTTCTAT 


GAGTACCTCA 


ATGTCAGGTG 


GAGCCAAGAA 


1680 


GTATGGTTCC 


GTTCGTACTT 


GTCTCGAGTG 


TGCATTAGTC 


CTTGGTTTGA 


TGGTAAAATA 


1740 


ACGTTGTGAA 


AAGTCCTCAT 


TCTACATCTT 


CAGTTCACCT 


AGTTCTCAAT 


GCAATAAGTG 


1800 


TTACTTAGAA 


GTTGATCTCC 


CTGGAGACGA 


ACTCCGTCCT 


TCTATGTAAA 


AACTTTTGCA 


1860 


AGAGAAAGGA 


AAACTTGGTG 


GTGGTACTGA 


TTTCCCCTAT 


GAGTGCATTG 


ATGAATGGAC 


1920 


AAAGAATAAA 


ACTCACGTAG 


ACAATATCGT 


TATTTTGTCT 


GATATGATGA 


TTGCAGAAGG 


1980 


ATATTCAGAT 


ATCAATGTTA 


GAGGCAGTTC 


CATTGTTAAC 


AGCATCAAAA 


AGTACAAGGA 


2040 


TGAAGTAAAT 


CCTAACATTA 


AAATCTTTGC 


AGTTGACTTA 


GAAGGTTACG 


GAAAGTGCCT 


2100 


TAATCTAGGT 


GATGAGTTCA 


ATGAAAACAA 


CTACATCAAG 


ATATTCGGTA 


TGAGCGATTC 


2160 


AATCTTAAAG 


TTCATTTCAG 


CCAAGCAAGG 


AGGAGCAAAT 


ATGGT CGAAG 


TTATCAAAAA 


2220 


CTTTGCCCTT 


CAAAAAATAG 


GACAAAAGTG 


AGTTTCTTGA 


GATTCTTCTA 


TAACAAAAAT 


2280 


CTCACCCCAC 


TTTTTTGTTT 


TATTGCATAG 


CCATTATGAA 


ATTTAAATTA 


TTATCTATTT 


2340 


ATTTAAGTTA 


CTTACATAGT 


TTATGTATCG 


CAGTCTATTA 


GCCTATTCAA 


ATGATTCTGC 


2400 


AAAGAACAAA 


AAAGATTAAA 


A 
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(2) INFORMATION FOR SEQ ID NO: 52 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Glu Leu Glu Leu Glu Met Gin Glu Asn Gin Asn Asp lie Gin Val Arg 
1 5 10 15 

Val Lys lie Asp Asp Pro Lys Gin Tyr Leu Val Asn Val Thr Ala Ala 
20 25 30 

Cys Leu Leu Gin Glu Gly Ser Tyr Tyr Gin Asp Lys Asp Glu Arg Arg 
35 40 45 

Tyr lie lie Thr Lys Ala Leu Leu Glu Val Ala Glu Ser Asp Pro Glu 
50 55 60 

Phe lie Cys Gin Leu Ala Val Tyr He Arg Asn Glu Leu Tyr He Arg 
65 70 75 80 

Thr Thr Thr Asn Tyr He Val Ala Phe Cys Val Val His Lys Asn Thr 
85 90 95 
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Gin Pro Phe lie Glu Lys Tyr Phe Asn Lys Ala Val Leu Leu Pro Asn 
100 105 110 

Asp Leu Leu Glu Val Cys Glu Phe Ala Gin Val Leu Tyr lie Phe Asp 
115 120 125 

Ala Thr Glu Phe Lys Asn Leu Tyr Leu Asp Arg lie Leu Ser Gin Asp 
130 135 140 

lie Arg Lys Glu Leu Thr Phe Arg Lys Cys Leu Gin Arg Cys Val Arg 
145 150 155 160 

Ser Lys Phe Ser Glu Phe Asn Glu Tyr Gin Leu Gly Lys Tyr Cys Thr 
165 170 175 

Glu Ser Gin Arg Lys Lys Thr Met Phe Arg Tyr Leu Ser Val Thr Asn 
180 .185 190 

Lys Gin Lys Trp Asp Gin Thr Lys Lys Lys Arg Lys Glu Asn Leu Leu 
195 200 205 

Thr Lys Leu Gin Ala lie Lys Glu Ser Glu Asp Lys Ser Lys Arg Glu 
210 215 220 

Thr Gly Asp lie Met Asn Val Glu Asp Ala lie Lys Ala Leu Lys Pro 
225 230 235 240 

Ala Val Met Lys Lys lie Ala Lys Arg Gin Asn Ala Met Lys Lys His 
245 250 255 

Met Lys Ala Pro Lys lie Pro Asn Ser Thr Leu Glu Ser Lys Tyr Leu 
260 265 270 

Thr Phe Lys Asp Leu lie Lys Phe Cys His lie Ser Glu Pro Lys Glu 
275 280 285 

Arg Val Tyr Lys lie Leu Gly Lys Lys Tyr Pro Lys Thr Glu Glu Glu 
290 295 300 

Tyr Lys Ala Ala Phe Gly Asp Ser Ala Ser Ala Pro Phe Asn Pro Glu 
305 310 . 315 320 

Leu Ala Gly Lys Arg Met Lys lie Glu lie Ser Lys Thr Trp Glu Asn 
325 330 335 

Glu Leu Ser Ala Lys Gly Asn Thr Ala Glu Val Trp Asp Asn Leu lie 
340 345 350 

Ser Ser Asn Gin Leu Pro Tyr Met Ala Met Leu Arg Asn Leu Ser Asn 
355 360 365 

lie Leu Lys Ala Gly Val Ser Asp Thr Thr His Ser lie Val lie Asn 
370 375 380 

Lys He Cys Glu Pro Lys Ala Val Glu Asn Ser Lys Met Phe Pro Leu 
385 390 395 400 
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Gin Phe Phe Ser Ala He Glu Ala Val Asn Glu Ala Val Thr Lys Gly 
405 410 415 

Phe Lys Ala Lys Lys Arg Glu Asn Met Asn Leu Lys Gly Gin He Glu 
420 425 430 

Ala Val Lys Glu Val Val Glu Lys Thr Asp Glu Glu Lys Lys Asp Met 
435 440 445 

Glu Leu Glu Gin Thr Glu Glu Gly Glu Phe Val Lys Val Asn Glu Gly 
450 455 460 

He Gly Lys Gin Tyr He Asn Ser He Glu Leu Ala lie Lys He Ala 
465 470 475 480 

Val Asn Lys Asn Leu Asp Glu lie Lys Gly His Thr Ala He Phe Ser 
485 490 495 

Asp Val Ser Gly Ser Met Ser Thr Ser Met Ser Gly Gly Ala Lys Lys 
500 505 510 

Tyr Gly Ser Val Arg Thr Cys Leu Glu Cys Ala Leu Val Leu Gly Leu 
515 520 525 

Met Val Lys Gin Arg Cys Glu Lys Ser Ser Phe Tyr He Phe Ser Ser 
530 535 540 

Pro Ser Ser Gin Cys Asn Lys Cys Tyr Leu Glu Val Asp Leu Pro Gly 
545 550 555 560 

Asp Glu Leu Arg Pro Ser Met Gin Lys Leu Leu Gin Glu Lys Gly Lys 
565 570 575 

Leu Gly Gly Gly Thr Asp Phe Pro Tyr Glu Cys He Asp Glu Trp Thr 
580 585 590 

Lys Asn Lys Thr His Val Asp Asn He Val He Leu Ser Asp Met Met 
595 600 605 

He Ala Glu Gly Tyr Ser Asp He Asn Val Arg Gly Ser Ser He Val 
610 615 620 

Asn Ser He Lys Lys Tyr Lys Asp Glu Val Asn Pro Asn lie Lys He 
625 630 635 640 

Phe Ala Val Asp Leu Glu Gly Tyr Gly Lys Cys Leu Asn Leu Gly Asp 
645 650 655 

Glu Phe Asn Glu Asn Asn Tyr He Lys lie Phe Gly Met Ser Asp Ser 
660 665 670 

He Leu Lys Phe He Ser Ala Lys Gin Gly Gly Ala Asn Met Val Glu 
675 680 685 

Val He Lys Asn Phe Ala Leu Gin Lys He Gly 
690 695 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2829 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 



TCAATACTAT 


TAATTAATAA 


ATAAAAAAAA 


GCAAACTACA 


AAGAAAATGT 


CAAGGCGTAA 


60 


CTAAAAAAAG 


CCATAGGCTC 


CTATAGGCAA 


TGAAACAAAT 


CTTGATTTTG 


TATTACAAAA 


120 


TCTAGAAGTT 


TACAAAAGCC 


AGATTGAGCA 


TTATAAGACC 


TAGTAGTAAT 


AGATCAAAGA 


180 


GGAGGATCTC 


AAGCTTTTAA 


AGTTCAAAAA 


TTAAGATTAG 


GATGGAAACT 


CTGGCAACGA 


240 


TGATGATGAT 


GAAGAAAACA 


ACTCAAATAA 


ATAATAAGAA 


TTATTAAGGA 


GAGTCAATTA 


300 


GATTAAGTAG 


CAAGTTTAAT 


TGATAAAAAA 


AGTTGGTTCT 


AAGGTAGAGA 


AAGATTTGAA 


360 


TTTGAACGAA 


GATGAAAACA 


AAAAGAATGG 


ACTTTCTGAA 


TAGCAAGTGA 


AAGAAGAGTA 


420 


ATTAAGAACG 


ATTACTGAAG 


AATAGGTTAA 


GTATTAAAAT 


TTAGTATTTA 


ACATGGACTA 


480 


CCAGTTAGAT 


TTAAATGAGA 


GTGGTGGCCA 


TAGAAGACAC 


AG AAG AGAAA 


CAGATTATGA 


540 


TACTGAAAAA 


TGGTTTGAAA 


TATCTCATGA 


CCAAAAAAAT 


TATGTATCAA 


TTTACGCCAA 


600 


CTAAAAGACA 


TCATATTGTT 


GGTGGCTTAA 


AGATTATTTT 


AATAAAAACA 


ATTATGATCA 


660 


TCTTAATGTA 


AGCATTAACA 


GACTAGAAAC 


TGAAGCCGAA 


TTCTATGCCT 


TTGATGATTT 


720 


TTCACAAACA 


ATCAAACTTA 


CTAATAATTC 


TTACTAGACT 


GTTAACATAG 


ACGTTAATTT 


780 


TGATAATAAT 


CTCTGTATAC 


TCGCATTGCT 


TAGATTTTTA 


TTATCACTAG 


AAAGATTCAA 


840 


TATTTTGAAT 


ATAAGATCTT 


CTTATACAAG 


AAATTAATAT 


AATTTTGAGA 


AAATTGGTGA 


900 


GCTACTTGAA 


ACTATCTTCG 


CAGTTGTCTT 


TTCTCATCGC 


CACTTACAAG 


GCATTCATTT 


960 


ACAAGTTCCT 


TGCGAAGCGT 


TCTAATATTT 


AGTTAACTCC 


TCATCATAAA 


TTAGCGTTAA 


1020 


AGATAGCTAA 


TTATAGGTAT 


ACTCTTTCTC 


TACAGACTTA 


AAATTAGTTG 


ACACTAACAA 


1080 


AGTCCAAGAT 


TATTTTAAGT 


TCTTATAAGA 


ATTCCCTCGT 


TTGACTCATG 


TAAGCTAGTA 


1140 


GGCTATCCCA 


GTTAGTGCTA 


CTAACGCTGT 


AGAGAACCTC 


AATGTTTTAC 


TTAAAAAGGT 


1200 


CAAGCATGCT 


AATCTTAATT 


TAGTTTCTAT 


CCCTACCTAA 


TTCAATTTTG 


ATTTCTACTT 


1260 


TGTTAATTTA 


TAACATTTGA 


AATTAGAGTT 


TGGATTAGAA 


C CAAATATTT 


TGACAAAACA 


1320 


AAAGCTTGAA 


AATCTACTTT 


TGAGTATAAA 


ATAATCAAAA 


AATCTTAAAT 


TTTTAAGATT 


1380 


AAACTTTTAC 


AC CTACGTTG 


CTTAAGAAAC 


CTCCAGAAAA 


CAGATATTAA 


AACAAGCTAC 


1440 


AACAATCAAA 


AATCTCAAAA 


ACAATAAAAA 


TCAAGAAGAA 


ACTCCTGAAA 


CTAAAGATGA 


1500 


AACTCCAAGC 


GAAAGCACAA 


GTGGTATGAA 


ATTTTTTGAT 


CATCTTTCTG 


AATTAACCGA 


1560 


GCTTGAAGAT 


TTCAGCGTTA 


ACTTGTAAGC 


TACCCAAGAA 


ATTTATGATA 


GCTTGCACAA 


1620 
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ACTTTTGATT 


AGATCAACAA 


ATTTAAAGAA 


GTTCAAATTA 


AGTTACAAAT 


ATGAAATGGA 


1680 


AAAGAGTAAA 


ATGGATACAT 


TCATAGATCT 


TAAGAATATT 


TATGAAACCT 


TAAACAATCT 


1740 


TAAAAGATGC 


TCTGTTAATA 


TATCAAATCC 


TCATGGAAAC 


ATTTCTTATG 


AACTGACAAA 


1800 


TAAAGATTCT 


ACTTTTTATA 


AATTTAAGCT 


GAC CTTAAAC 


TAAGAATTAT 


AACACGGTAA 


1860 


GTATAC T TT T 


AAGTAGAACG 


AATTTTAATT 


TAATAACGTT 


AAAAGTG CAA 


AAATTGAATC 


1920 


TTCCTGATTA 


GAAAGCTTAG 


AAGATATTGA 


TAGTCTTTGC 


AAATCTATTG 


CTTCTTGTAA 


1980 


AAATTTACAA 


AATGTTAATA 


TTATCGCCAG 


TTTGCTCTAT 


CCCAACAATA 


TTTAGAAAAA 


2040 


TCCTTTCAAT 


AAGCCCAATC 


TTCTATTTTT 


GAAGCAATTT 


GAATAATTGA 


AAAATTTGGA 


2100 


AAATGTATCT 


ATCAACTGTA 


TTCTTGATCA 


GCATATACTT 


AATTCTATTT 


CAGAATTCTT 


2160 


AGAAAAGAAT 


AAAAAAATAA 


AAGCATTCAT 


TTTGAAAAGA 


TATTATTTAT 


TACAATATTA 


2220 


TCTTGATTAT 


ACTAAATTAT 


TTAAAACACT 


TCAATAGTTA 


CCTGAATTAA 


ATTAAGTTTA 


2280 


CATTAATTAG 


CAATTAGAAG 


AATTGACTGT 


GAGTGAAGTA 


CATAAGTAAG 


TATGGGAAAA 


2340 


CCACAAGCAA 


AAAGCTTTCT 


ATGAACCATT 


ATGTGAGTTT 


ATCAAAGAAT 


CATCCTAAAC 


2400 


GCTTTAGCTA 


ATAGATTTTG 


ACCAAAACAC 


TGTAAGTGAT 


GACTCTATTA 


AAAAGATTTT 


2460 


AGAATCTATA 


TCTGAGTCTA 


AGTATCATCA 


TTATTTGAGA 


TTGAACCCTA 


GTTAATCTAG 


2520 


CAGTTTAATT 


AAATCTGAAA 


ACGAAGAAAT 


TTAAGAACTT 


CTCAAAGCTT 


GCGACGAAAA 


2580 


AGGTGTTTTA 


GTAAAAGCAT 


ACTATAAATT 


CCCTCTATGT 


T TAC L AAC TO? 


Q7IAL.TIATTA 


O £7 A A 

2o40 


CGATTACAAT 


TCAGATAGAT 


GGTGATTAAT 


TAAATATTAG 


TTTAAATAAA 


TATTAAATAT 


2700 


TGAATATTTC 


TTTGCTTATT 


ATTTGAATAA 


TACATACAAT 


AGTCATTTTT 


AGTGTTTTGA 


2760 


ATATATTTTA 


GTTATTTAAT 


TCATTATTTT 


AAGTAAATAA 


TTATTTTTCA 


ATCATTTTTT 


2820 


AAAAAATCG 












2829 


(2) INFORMATION FOR SEQ ID NO: 54 











(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 872 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Ser Arg Arg Asn Gin Lys Lys Pro Gin Ala Pro lie Gly Asn Glu 
1 5 10 15 

Thr Asn Leu Asp Phe Val Leu Gin Asn Leu Glu Val Tyr Lys Ser Gin 
20 25 30 

lie Glu His Tyr Lys Thr Gin Gin Gin Gin lie Lys Glu Glu Asp Leu 
35 40 45 
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Lys Leu Leu Lys Phe 
50 

Asp Asp Asp Asp Glu 
65 

Arg Arg Val Asn Gin 
85 

Gly Ser Lys Val Glu 
100 

Lys Asn Gly Leu Ser 
115 

He Thr Glu Glu Gin 
130 

Tyr Gin Leu Asp Leu 
145 

Glu Thr Asp Tyr Asp 
165 

Lys Asn Tyr Val Ser 
180 

Trp Leu Lys Asp Tyr 
195 

Ser He Asn Arg Leu 
210. 

Phe Ser Gin Thr He 
225 

He Asp Val Asn Phe 
245 

Phe Leu Leu Ser Leu 
260 

Tyr Thr Arg Asn Gin 
275 

Thr He Phe Ala Val 
290 

Leu Gin Val Pro Cys 
305 

Gin He Ser Val Lys 
325 

Asp Leu Lys Leu Val 
340 

Leu Gin Glu Phe Pro 
355 

Val Ser Ala Thr Asn 
370 



Lys Asn Gin Asp Gin Asp 
55 

Glu Asn Asn Ser Asn Lys 
70 75 

He Lys Gin Gin Val Gin 
90 

Lys Asp Leu Asn Leu Asn 
105 

Glu Gin Gin Val Lys Glu 
120 

Val Lys Tyr Gin Asn Leu 
135 

Asn Glu Ser Gly Gly His 
150 155 

Thr Glu Lys Trp Phe Glu 
170 

He Tyr Ala Asn Gin Lys 
185 

Phe Asn Lys Asn Asn Tyr 
200 

Glu Thr Glu Ala Glu Phe 
215 

Lys Leu Thr Asn Asn Ser 
230 235 

Asp Asn Asn Leu Cys He 
250 

Glu Arg Phe Asn lie Leu 
265 

Tyr Asn Phe Glu Lys He 
280 

Val Phe Ser His Arg His 
295 

Glu Ala Phe Gin Tyr Leu 
310 315 

Asp Ser Gin Leu Gin Val 
330 

Asp Thr Asn Lys Val Gin 
345 

Arg Leu Thr His Val Ser 
360 

Ala Val Glu Asn Leu Asn 
375 




Gly Asn Ser Gly Asn 
60 

Gin Gin Glu Leu Leu 
80 

Leu He Lys Lys Val 
95 

Glu Asp Glu Asn Lys 
110 

Glu Gin Leu Arg Thr 
125 

Val Phe Asn Met Asp 
140 

Arg Arg His Arg Arg 
160 

He Ser His Asp Gin 
175 

Thr Ser Tyr Cys Trp 
190 

Asp His Leu Asn Val 
205 

Tyr Ala Phe Asp Asp 
220 

Tyr Gin Thr Val Asn 
240 

Leu Ala Leu Leu Arg 
255 

Asn He Arg Ser Ser 
270 

Gly Glu Leu Leu Glu 
285 

Leu Gin Gly He His 
300 

Val Asn Ser Ser Ser 
320 

Tyr Ser Phe Ser Thr 
335 

Asp Tyr Phe Lys Phe 
350 

Gin Gin Ala He Pro 
365 

Val Leu Leu Lys Lys 
380 
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Val Lys His Ala Asn Leu Asn Leu Val Ser lie Pro Thr Gin Phe Asn 
385 390 395 400 

Phe Asp Phe Tyr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly 
405 410 415 

Leu Glu Pro Asn lie Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu 
420 425 430 

Ser lie Lys Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr 
435 440 445 

Thr Tyr Val Ala Gin Glu Thr Ser Arg Lys Gin lie Leu Lys Gin Ala 
450 455 460 

Thr Thr lie Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro 
465 470 475 480 

Glu Thr Lys Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe 
485 490 495 

Phe Asp His Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn 
500 505 510 

Leu Gin Ala Thr Gin Glu lie Tyr Asp Ser Leu His Lys Leu Leu lie 
515 520 525 

Arg Ser Thr Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met 
530 535 540 

Glu Lys Ser Lys Met Asp Thr Phe lie Asp Leu Lys Asn lie Tyr Glu 
545 550 555 560 

Thr Leu Asn Asn Leu Lys Arg Cys Ser Val Asn lie Ser Asn Pro His 
565 570 575 

Gly Asn lie Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys 
580 585 590 

Phe Lys Leu Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe 
595 600 605 

Lys Gin Asn Glu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys lie Glu 
610 615 620 

Ser Ser Ser Leu Glu Ser Leu Glu Asp lie Asp Ser Leu Cys Lys Ser 
625 630 635 640 

lie Ala Ser Cys Lys Asn Leu Gin Asn Val Asn lie lie Ala Ser Leu 
645 650 655 

Leu Tyr Pro Asn Asn lie Gin Lys Asn Pro Phe Asn Lys Pro Asn Leu 
660 665 670 

Leu Phe Phe Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser 
675 680 685 

lie Asn Cys lie Leu Asp Gin His He Leu Asn Ser He Ser Glu Phe 
690 695 700 

Leu Glu Lys Asn Lys Lys He Lys Ala Phe He Leu Lys Arg Tyr Tyr 



705 



710 



715 



720 
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Leu Leu Gin Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr Leu Gin 
725 730 735 

Gin Leu Pro Glu Leu Asn Gin Val Tyr lie Asn Gin Gin Leu Glu Glu 
740 745 750 

Leu Thr Val Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin 
755 760 765 

Lys Ala Phe Tyr Glu Pro Leu Cys Glu Phe lie Lys Glu Ser Ser Gin 
770 775 780 

Thr Leu Gin Leu lie Asp Phe Asp Gin Asn Thr Val Ser Asp Asp Ser 
785 790 795 800 

lie Lys Lys He Leu Glu Ser He Ser Glu Ser Lys Tyr His His Tyr 
805 810 815 

Leu Arg Leu Asn Pro Ser Gin Ser Ser Ser Leu He Lys Ser Glu Asn 
820 825 830 

Glu Glu He Gin Glu Leu Leu Lys Ala Cys Asp Glu Lys , Gly Val Leu 
835 840 845 

Val Lys Ala Tyr Tyr Lys Phe Pro Leu Cys Leu Pro Thr Gly Thr Tyr 
850 855 860 

Tyr Asp Tyr Asn Ser Asp Arg Trp 
865 870 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Lys He Leu Phe Glu Phe He Gin Asp Lys Leu Asp He Asp Leu 
1 5 10 15 

Gin Thr Asn Ser Thr Tyr Lys Glu Asn Leu Lys Cys Gly His Phe Asn 
20 25 30 

Gly Leu Asp Glu He Leu Thr Thr Cys Phe Ala Leu Pro Asn Ser Arg 
35 40 45 

Lys He Ala Leu Pro Cys Leu Pro Gly Asp Leu Ser His Lys Ala Val 
50 55 60 

He Asp His Cys He lie Tyr Leu Leu Thr Gly Glu Leu Tyr Asn Asn 
65 70 75 80 

Val Leu Thr Phe Gly Tyr Lys He Ala Arg Asn Glu Asp Val Asn Asn 
85 90 95 

Ser Leu Phe Cys His Ser Ala Asn Val Asn Val Thr Leu Leu Lys Gly 
100 105 110 
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Ala Ala Trp Lys Met Phe His Ser Leu Val Gly Thr Tyr Ala Phe Val 
115 120 125 

Asp Leu Leu lie Asn Tyr Thr Val lie Gin Phe Asn Gly Gin Phe Phe 
130 135 140 

Thr Gin lie Val Gly Asn Arg Cys Asn Glu Pro His Leu Pro Pro Lys 
145 150 155 160 

Trp Val Gin Arg Ser Ser Ser Ser Ser Ala Thr Ala Ala Gin lie Lys 
165 170 175 

Gin Leu Thr Glu Pro Val Thr Asn Lys Gin Phe Leu His Lys Leu Asn 
180 185 190 

lie Asn Ser Ser Ser Phe Phe Pro Tyr Ser Lys lie Leu Pro Ser Ser 
195 200 205 

Ser Ser lie Lys Lys Leu Thr Asp Leu Arg Glu Ala lie Phe Pro Thr 
210 215 220 

Asn Leu Val Lys lie Pro Gin Arg Leu Lys Val Arg lie Asn Leu Thr 
225 230 235 240 

Leu Gin Lys Leu Leu Lys Arg His Lys Arg Leu Asn Tyr Val Ser lie 
245 250 255 

Leu Asn Ser lie Cys Pro Pro Leu Glu Gly Thr Val Leu Asp Leu Ser 
260 265 270 

His Leu Ser Arg Gin Ser Pro Lys Glu Arg Val Leu Lys Phe lie lie 
275 280 285 

Val He Leu Gin Lys Leu Leu Pro Gin Glu Met Phe Gly Ser Lys Lys 
290 295 300 

Asn Lys Gly Lys He He Lys Asn Leu Asn Leu Leu Leu Ser Leu Pro 
305 310 315 320 

Leu Ash Gly Tyr Leu Pro Phe Asp Ser Leu Leu Lys Lys Leu Arg Leu 
325 330 335 

Lys Asp Phe Arg Trp Leu Phe He Ser Asp lie Trp Phe Thr Lys His 
340 345 350 

Asn Phe Glu Asn Leu Asn Gin Leu Ala He Cys Phe He Ser Trp Leu 
355 360 365 

Phe Arg Gin Leu He Pro Lys He He Gin Thr Phe Phe Tyr Cys Thr 
370 375 380 

Glu He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp Thr Trp 
385 390 395 400 

Asn Lys Leu He Thr Pro Phe He Val Glu Tyr Phe Lys Thr Tyr Leu 



Phe Asn His Ser Lys Met Arg He He Pro Lys Lys Ser Asn Asn Glu 
435 440 445 
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405 



410 



415 



Val Glu 



Asn Asn Val Cys Arg Asn His Asn Ser Tyr Thr Leu Ser Asn 
420 425 430 



Phe Arg lie lie Ala lie Pro Cys Arg Gly Ala Asp Glu Glu Glu Phe 
450 455 460 

Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro Thr Gin Lys 
465 470 475 480 

He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys He 
485 490 495 

Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys Glu Phe Lys Gin Arg 
500 505 510 

Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met Lys 
515 520 525 

Phe Asp Val Lys Ser Cys Tyr Asp Ser He Pro Arg Met Glu . Cys Met 
530 535 540 

Arg He Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val Arg 
545 550 555 560 

Ser Gin Tyr Phe Phe Asn Thr Asn Thr Gly Val Leu Lys Leu Phe Asn 
565 570 575 

Val Val Asn Ala Ser Arg Val Pro Lys Pro Tyr Glu Leu Tyr He Asp 
580 585 590 

Asn Val Arg Thr Val His Leu Ser Asn Gin Asp Val He Asn Val Val 
595 600 605 

Glu Met Glu He Phe Lys Thr Ala Leu Trp Val Glu Asp Lys Cys Tyr 
610 615 620 

He Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro He 
625 630 635 640 

Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe Lys 
645 650 655 

Ala Ser Pro Ser Gin Asp Thr Leu He Leu Lys Leu Ala Asp Asp Phe 
660 665 670 

Leu He He Ser Thr Asp Gin Gin Gin Val lie Asn He Lys Lys Leu 
675 680 685 

Ala Met Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala Asn Arg Asp Lys 
690 695 700 

He Leu Ala Val Ser Ser Gin Ser Asp Asp Asp Thr Val He Gin Phe 
705 710 715 720 

Cys Ala Met His He Phe Val Lys Glu Leu Glu Val Trp Lys His Ser 
725 730 735 

Ser Thr Met Asn Asn Phe His He Arg Ser Lys Ser Ser Lys Gly. He 
740 745 750 

Phe Arg Ser Leu He Ala Leu Phe Asn Thr Arg He Ser Tyr Lys Thr 
755 760 765 

He Asp Thr Asn Leu Asn Ser Thr Asn Thr Val Leu Met Gin He Asp 
770 775 780 
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His Val Val Lys Asn He Ser Glu Cys Tyr Lys Ser Ala Phe Lys Asp 
785 790 795 800 

Leu Ser He Asn Val Thr Gin Asn Met Gin Phe His Ser Phe Leu Gin 
805 810 815 

Arg He He Glu Met Thr Val Ser Gly Cys Pro He Thr Lys Cys Asp 
820 825 830 

Pro Leu He Glu Tyr Glu Val Arg Phe Thr He Leu Asn Gly Phe Leu 
835 840 845 

Glu Ser Leu Ser Ser Asn Thr Ser Lys Phe Lys Asp Asn He He Leu 
850 855 860 

Leu Arg Lys Glu He Gin His Leu Gin Ala Tyr He Tyr He Tyr He 
865 870 875 880 

His lie Val Asn 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc s "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

YARACHAARG GHATYCCHYA RGG 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /des.C = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

DGTDATNARN ARRTARTCRT C 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
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Leu Cys Val Ser Tyr lie Leu Ser Ser Phe Tyr Tyr Ala Asn Leu Glu 
15 10 15 



Glu Asn Ala Leu Gin Phe Leu Arg Lys Glu Ser Met Asp Pro Glu Lys 
20 25 30 

Pro Glu Thr Asn Leu Leu Met Arg Leu Thr 
35 40 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Leu Cys Val Ser Ser lie Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu 
1 5 10 15 

Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Glu Asn 
20 25 30 

Pro Asn Val Asn Leu Leu Met Arg Leu Thr 
35 40 

(2) INFORMATION FOR SEQ ID NO : 60: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
'(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "RNA" 

(ix) FEATURE: 

(A) NAME/KEY: modif ied_base 

(B) LOCATION: 12. .25 

(D) OTHER INFORMATION: /mod__base= OTHER 
/note= "The residues located at these positions are 
2-O-methylribonucleoti . . . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

TAGACCTGTT AGGUUUUGGG GUUUUG 

(2) INFORMATION FOR SEQ ID NO:61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " DNA" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GGGGTTTTGG GGTTTT 16 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1..389 

(D) OTHER INFORMATION: /note= "expressed sequence tag (EST) 

AA281296" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
GCCAAGTTCC TGCACTGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG GTCTTTCTTT 60 
TATGTCACGG AGACCACGTT TCAAAAGAAC AGGCTCTTTT TCTACCGGAA GAGTGTCTGG 12 0 

AGCAAGTTGC AAAGCATTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT GCGGGACGTG 180 
TCGGAAGCAG AGGT CAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC GTCCAGACTC 24 0 

CGCTTCATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA ACATGGACTA CGTCGTGGGA 300 
GCCAGAACGT TCCGCAGAGA AAAGAGGGCC GAGCGTCTCA CCTCGAGGGT GAAGGCACTG 360 
TTCAGCGTGC TCAACTACGA GCGGGCGCG 38 9 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .23 3 

(D) OTHER INFORMATION: /note- "TRT motifs from 

Schizosaccharomyces pombe tezl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

lie Ser Glu lie Glu Trp Leu Val Leu Gly Lys Arg Ser Asn Ala Lys 
1 5 10 15 

Met Cys Leu Ser Asp Phe Glu Lys Arg Lys Gin He Phe Ala Glu Phe 
20 25 30 
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lie Tyr Trp Leu Tyr Asn Ser Phe lie lie Pro lie Leu Gin Ser Phe 
35 40 45 

Phe Tyr lie Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe 
50 55 60 

Arg Lys Asp lie Trp Lys Leu Leu Cys Arg Pro Phe lie Thr Ser Met 
65 70 75 80 

Lys Met Glu Ala Phe Glu Lys lie Asn Glu Asn Asn Val Arg Met Asp 
85 90 95 

Thr Gin Lys Thr Thr Leu Pro Pro Ala Val lie Arg Leu Leu Pro Lys 
100 105 110 

Lys Asn Thr Phe Arg Leu lie Thr Asn Leu Arg Lys Arg Phe Leu lie 
115 120 125 

Lys Met Gly Ser Asn Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leu 
130 135 140 

Arg Pro Val Ala Ser lie Leu Lys His Leu He Asn Glu Glu Ser Ser 
145 150 155 160 

Gly He Pro Phe Asn Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys 
165 170 175 

Lys Asp Leu Leu Lys His Arg Met Phe Gly Arg Lys Lys Tyr Phe Val 
180 185 190 

Arg He Asp He Lys Ser Cys Tyr Asp Arg He Lys Gin Asp Leu Met 
195 200 205 

Phe Arg He Val Lys Lys Lys Leu Lys Asp Pro Glu Phe Val He Arg 
210 215 220 

Lys Tyr Ala Thr He His Ala Thr Ser 
225 230 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME / KEY : Peptide 

(B) LOCATION: 1..233 

(D) OTHER INFORMATION: /note= "TRT motifs from Saccharomyces 

cerevisiae EST2 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Lys Asp Phe Arg Trp Leu Phe He Ser Asp He Trp Phe Thr Lys 
1 5 10 15 
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His Asn Phe Glu Asn Leu Asn Gin Leu Ala lie Cys Phe lie Ser Trp 
20 25 30 

Leu Phe Arg Gin Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr Cys 
35 40 45 

Thr Glu lie Ser Ser Thr Val Thr lie Val Tyr Phe Arg His Asp Thr 
50 55 60 

Trp Asn Lys Leu lie Thr Pro Phe He Val Glu Tyr Phe Lys Thr Tyr 
65 70 75 80 

Leu Val Glu Asn Asn Val Cys Arg Asn His Asn Ser Tyr Thr Leu Ser 
85 90 95 

Asn Phe Asn His Ser Lys Met Arg He He Pro Lys Lys Ser Asn Asn 
100 105 110 

Glu Phe Arg He He Ala lie Pro Cys Arg Gly Ala Asp Glu Glu Glu 
115 120 125 

Phe Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro Thr Gin 
130 135 140 

Lys He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys 
145 150 155 160 

He Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys Glu Phe Lys Gin 
165 170 175 

Arg Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met 
180 185 190 

Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser lie Pro Arg Met Glu Cys 
195 200 205 

Met Arg He Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val 
210 215 220 

Arg Ser Gin Tyr Phe Phe Asn Thr Asn 
225 230 



(2) INFORMATION FOR SEQ ID NO : 65: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1„ .233 

(D) OTHER INFORMATION: /note= " TRT motifs from Euplotes 

aediculatus p!23" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
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Thr Arg Glu lie Ser Trp Met Gin Val Glu Thr Ser Ala Lys His Phe 
1 5 10 15 

Tyr Tyr Phe Asp His Glu Asn lie Tyr Val Leu Trp Lys Leu Leu Arg 
20 25 30 

Trp lie Phe Glu Asp Leu Val Val Ser Leu lie Arg Cys Phe Phe Tyr 
35 40 45 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
50 55 60 

Asn He Trp Asp Val He Met Lys Met Ser He Ala Asp Leu Lys Lys 
65 70 75 80 

Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu Glu Trp Lys Lys 
85 90 95 

Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu He Pro Lys Lys Thr 
100 105 110 

Thr Phe Arg Pro He Met Thr Phe Asn Lys Lys He Val Asn Ser Asp 
115 120 125 

Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu Asn Ser His 
130 135 140 

Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys Asp Pro Phe Gly 
145 150 155 160 

Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr Glu Glu Phe 
165 170 175 

Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe Phe Ala Thr 
180 185 190 

Met Asp lie Glu Lys' Cys Tyr Asp Ser Val Asn Arg Glu Lys Leu Ser 
195 200 205 

Thr Phe Leu Lys Thr Thr Lys Leu Leu Ser Ser Asp Phe Trp He Met 
210 215 220 

Thr Ala Gin lie Leu Lys Arg Lys Asn 
225 230 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lineair 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..2631 

(D) OTHER INFORMATION:, /note- " Saccharomyces cerevisiae 

EST2 " 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 



ATTTATACTC 


ATGAAAATCT 


TATTCGAGTT 


CATTCAAGAC 


AAGCTTGACA 


TTGATCTACA 


60 


GACCAACAGT 


ACTTACAAAG 


AAAATTTAAA 


ATGTGGTCAC 


TTCAATGGCC 


TCGATGAAAT 


120 


TCTAACTACG 


TGTTTCGCAC 


TACCAAATTC 


AAGAAAAATA 


GCATTACCAT 


GCCTTCCTGG 


180 


TGACTTAAGC 


CACAAAGCAG 


TCATTGATCA 


CTGCATCATT 


TACCTGTTGA 


CGGGCGAATT 


240 


ATACAACAAC 


GTACTAACAT 


TTGGCTATAA 


AATAGCTAGA 


AATGAAGATG 


TCAACAATAG 


300 


TCTTTTTTGC 


CATTCTGCAA 


ATGTTAACGT 


TACGTTACTG 


AAAGGCGCTG 


CTTGGAAAAT 


360 


GTTCCACAGT 


TTGGTCGGTA 


CATACGCATT 


CGTTGATTTA 


TTGATCAATT 


ATACAGTAAT 


420 


TCAATTTAAT 


GGGCAGTTTT 


TCACTCAAAT 


CGTGGGTAAC 


AGATGTAACG 


AACCTCATCT 


480 


GCCGCCCAAA 


TGGGTCCAAC 


GATCATCCTC 


ATCATCCGCA 


ACTGCTGCGC 


AAATCAAACA 


540 


ACTTACAGAA 


CCAGTGACAA 


ATAAACAATT 


CTTACACAAG 


CTCAATATAA 


ATTCCTCTTC 


600 


TTTTTTTCCT 


TATAGCAAGA 


TCCTTCCTTC 


ATCATCATCT 


ATCAAAAAGC 


TAACTGACTT 


66 0 


GAGAGAAGCT 


ATTTTTCCCA 


CAAATTTGGT 


TAAAATTCCT 


CAGAGACTAA 


AGGTACGAAT 


720 


TAATTTGACG 


CTGCAAAAGC 


TATTAAAGAG 


ACATAAGCGT 


TTGAATTACG 


TTTCTATTTT 


780 


GAATAGTATT 


TGCCCACCAT 


TGGAAGGGAC 


CGTATTGGAC 


TTGTCGCATT 


TGAGTAGGCA 


840 


ATCAC CAAAG 


GAACGAGTCT 


TGAAATTTAT 


CATTGTTATT 


TTACAGAAGT 


TATTACCCCA 


900 


AGAAATGTTT 


GGCTCAAAGA 


AAAATAAAGG 


AAAAATTATC 


AAGAATCTAA 


ATCTTTTATT 


960 


AAGTTTACCC 


TTAAATGGCT 


ATTTACCATT 


TGATAGTTTG 


TTGAAAAAGT 


TAAGATTAAA 


1020 


GGATTTTCGG 


TGGTTGTTCA 


TTTCTGATAT 


TTGGTTCACC 


AAGCACAATT 


TTGAAAACTT 


1080 


GAATCAATTG 


GCGATTTGTT 


TCATTTCCTG 


GCTATTTAGA 


CAACTAATTC 


CCAAAATTAT 


1140 


ACAGACTTTT 


TTTTACTGCA 


CCGAAATATC 


TTCTACAGTG 


ACAATTGTTT 


ACTTTAGACA 


1200 


TGATACTTGG 


AATAAACTTA 


TCACCCCTTT 


TATCGTAGAA 


TATTTTAAGA 


CGTACTTAGT 


1260 


CGAAAACAAC 


GTATGTAGAA 


ACCATAATAG 


TTACACGTTG 


TCCAATTTCA 


ATCATAGCAA 


1320 


AATGAGGATT 


ATACCAAAAA 


AAAGTAATAA 


TGAGTTCAGG 


ATTATTGCCA 


TCCCATGCAG 


1380 


AGGGGCAGAC 


GAAGAAGAAT 


TCACAATTTA 


TAAGGAGAAT 


CACAAAAATG 


CTATCCAGCC 


1440 


CACTCAAAAA 


ATTTTAGAAT 


ACCTAAGAAA 


CAAAAGGCCG 


ACTAGTTTTA 


CTAAAATATA 


1500 


TTCTCCAACG 


CAAATAGCTG 


ACCGTATCAA 


AGAATTTAAG 


CAGAGACTTT 


TAAAGAAATT 


1560 


TAATAATGTC 


TTACCAGAGC 


TTTATTTCAT 


GAAATTTGAT 


GTCAAATCTT 


GCTATGATTC 


1620 


CAT AC C AAGG 


ATGGAATGTA 


TGAGGATACT 


CAAGGATGCG 


CTAAAAAATG 


AAAATGGGTT 


1680 


TTTCGTTAGA 


TCTCAATATT 


TCTTCAATAC 


CAATACAGGT 


GTATTGAAGT 


TATTTAATGT 


1740 


TGTTAACGCT 


AGCAGAGTAC 


CAAAACCTTA 


TGAGCTATAC 


ATAGATAATG 


TGAGGACGGT 


1800 




TCATTTATCA 


AATCAGGATG 


TTATAAACGT 


TGTAGAGATG 


GAAATATTTA 


AAACAGCTTT 


1860 


GTGGGTTGAA 


GATAAGTGCT 


ACATTAGAGA 


AGATGGTCTT 


TTTCAGGGCT 


CTAGTTTATC 


1920 


TGCTCCGATC 


GTTGATTTGG 


TGTATGACGA 


TCTTCTGGAG 


TTTTATAGCG 


AGTTTAAAGC 


1980 


CAGTCCTAGC 


CAGGACACAT 


TAATTTTAAA 


ACTGGCTGAC 


GATTTCCTTA 


TAATATCAAC 


2040 


AGACCAACAG 


CAAGTGATCA 


ATATCAAAAA 


GCTTGCCATG 


GGCGGATTTC 


AAAAATATAA 


2100 


TGCGAAAGCC 


AATAGAGACA 


AAATTTTAGC 


CGTAAGCTCC 


CAATCAGATG 


ATGATACGGT 


2160 


TATT C AATTT 


TGTGCAATGC 


ACATATTTGT 


TAAAGAATTG 


GAAGTTTGGA 


AACATTCAAG 


2220 


CACAATGAAT 


AATTTCCATA 


TCCGTTCGAA 


ATCTAGTAAA 


GGGATATTTC 


GAAGTTTAAT 


2280 


AGCGCTGTTT 


AACACTAGAA 


TCTCTTATAA 


AACAATTGAC 


ACAAATTTAA 


ATTCAACAAA 


2340 


CAGCGTTCTC 


ATGCAAATTG 


ATCATGTTGT 


AAAGAACATT 


TCGGAATGTT 


ATAAATCTGC 


2400 


TTTTAAGGAT 


CTAT CAATTA 


ATGTTACGCA 


AAATATGCAA 


TTTCATTCGT 


TCTTACAACG 


2460 


CATCATTGAA 


ATGACAGTCA 


GCGGTTGTCC 


AATTACGAAA 


TGTGATCCTT 


TAATCGAGTA 


2520 


TGAGGTACGA 


TTCAGCATAT 


TGAATGGATT 


TTTGGAAAGC 


CTATCTTCAA 


ACACATCAAA 


2580 


ATTTAAAGAT 


AATATCATTC 


TTTTGAGAAA 


GGAAATTCAA 


CACTTGCAAG 


C 


2631 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .129 

.(D) OTHER INFORMATION: /note* "TRT motifs from human" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu 
1 5 10 15 

Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu 
20 25 30 

Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly He 
35 40 45 

Arg Gin His Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu 
50 55 60 

Val Arg Gin His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu 
65 70 75 80 
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Arg Phe lie Pro Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met Asp 
85 90 95 

Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg 
100 105 110 

Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu- Arg 
115 120 125 

Ala 

(2) INFORMATION FOR SEQ ID NO: 68 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: join (959 1216 , 1273.. 1353, 1425.. 1543, 

1595. .1857, 1894. .2286, 2326. .2396, 2436. .2705, 
2746. .2862, 2914. .3083, 3125. .3309, 3356. .3504, 
3546.. 3759, 3797.. 4046, 4086.. 4252, 4296.. 4392, 
4435.. 4597) 

(D) OTHER INFORMATION: /note- "Schizosaccharomyces pombe 

telomerase catalytic subun.it (TRT) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

GGTACCGATT TACTTTCCTT TCTTCATAAG CTAATTGCTT CCTCGAACGC TCCTAAATCT 60 

CTGGAAATAT TTTTACAAGA ACTCAATAAC AATACCAAGT CAAATTCCAA TATGAAGGTG 120 

TTATTAGTGA TCGATAATAT TTCTATTTTA TCGGTCGTTA CCAAGTATAA GGACAAAAAG 18 0 

AACAACTTCC TTCCCCCTAA AGACTTTTAC TTTATTAATT TACTTTTCAA ATATATTTCG 24 0 

GGTTCGCTTA CTTTTAATCG TGGTACTGTT TTAGCTGCTA CTTCTAGCCA ACCGCGTGTT 300 

TCTACCCCGT CATTGGATAT AGCTCTTGGA GTAGCTCACA GAAATCCTTA CAAATCTTCT 36 0 

GATGAGACTA TATTAGATTC ATTACAGTCC GTGCATATTC TTAACATGGA GCCTTACACT 42 0 

TTAGATGAGT CACGTCGCAT GATGGAGTAT TTGGTAT CAT CCAACGTTTG CCTTGAAAAG 480 

GTTGATAATT ATTTGCAAAA TCATGTCCTT AGTGGTGGTA ATCCGCGAAA GTTTTTTGAT 54 0 

GCTTGCACAC GTCTAGCATG ATTGAGATAT TCAAAAATTT CTATC CACTA CAACTCCTTT 600 

AACGCGGTTT TATTTTTCTA TTTTCTATTC TCATGTTGTT CCAAATATGT ATCATCTCGT 660 

ATTAGGCTTT TTTCCGTTTT ACTCCTGGAA TCGTACCTTT TTCACTATTC CCCCTAATGA 72 0 

ATAATCTAAA TTAGTTTCGC TTATAATTGA TAGTAGTAGA AAGATTGGTG ATTCTACTCG 78 0 
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TGTAATGTTA TTAGTTTAAA GATACTTTGC AAAACATTTA TTAGCTATCA TTATATAAAA 

AAAATCCTAT AATTATAAAT ATTAATCAAT ATTTGCGGTC ACTATTTATT TAAAACGTTA 

TGATCAGTAG GACACTTTGC ATATATATAG TTATGCTTAA TGGTTACTTG TAACTTGC 

ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG 
Met Thr Glu His His Thr Pro Lys Ser Arg lie Leu Arg Phe Leu Glu 
1 5 10 15 

AAT CAA TAT GTA TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT 
Asn Gin Tyr Val Tyr Leu Cys Thr Leu Asn Asp Tyr Val Gin Leu Val 
20 25 30 

TTG AGA GGG TCG CCG GCA AGC TCG TAT AGC AAT ATA TGC GAA CGC TTG 
Leu Arg Gly Ser Pro Ala Ser Ser Tyr Ser Asn lie Cys Glu Arg Leu 
35 40 45 

AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT CAT TCG ACT GTA 
Arg Ser Asp Val Gin Thr Ser Phe Ser lie Phe Leu His Ser Thr Val 
50 55 60 

GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 
Val Gly Phe Asp Ser Lys Pro Asp Glu Gly Val Gin Phe Ser Ser Pro 
65 70 75 80 

AAA TGC TCA GAG TCA GAG GTATATATAT TTTTGTTTTG ATTTTTTTCT 
Lys Cys Ser Gin Ser Glu 
85 

ATTCGGGATA GCTAATATAT GGGCAG CTA ATA GCG AAT GTT GTA AAA CAG ATG 

Leu lie Ala Asn Val Val Lys Gin Met 
90 95 

TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA CTG ATG AAA GGG TTT 
Phe Asp Glu Ser Phe Glu Arg Arg Arg Asn Leu Leu Met Lys Gly Phe 
100 105 110 

TCC ATG GTAAGGTATT CTAATTGTGA AATATTTACC TGCAATTACT GTTTCAAAGA 
Ser Met 



GATTGTATTT AACCGATAAA G AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA 

Asn His Glu Asp Phe Arg Ala Met His Val 
115 120 

AAC GGA GTA CAA AAT GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA 
Asn Gly Val Gin Asn Asp Leu Val Ser Thr Phe Pro Asn Tyr Leu lie 
125 130 135 

TCT ATA CTT GAG TCA AAA AAT TGG CAA CTT TTG TTA GAA AT 
Ser lie Leu Glu Ser Lys Asn Trp Gin Leu Leu Leu Glu lie 
140 145 150 

GTAAATACCG GTTAAGATGT TGCGCACTTT G AAC AAGAC T GACAAGTATA G T ATC 

He 



GGC AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG 
Gly Ser Asp Ala Met His Tyr Leu Leu Ser Lys Gly Ser He Phe Glu 
■155 160 165 170 



840 
900 
958 
1006 

1054 

1102 

1150 

1198 

1246 

1299 

1347 

1403 

1454 

1502 

1543 

1598 

1646 
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GCT CTT CCA AAT GAC AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT 1694 
Ala Leu Pro Asn Asp Asn Tyr Leu Gin lie Ser Gly lie Pro Leu Phe 
175 180 185 

AAA AAT AAT GTG TTT GAG GAA ACT GTG TCA AAA AAA AGA AAG CGA ACC 1742 
Lys Asn Asn Val Phe Glu Glu Thr Val Ser Lys Lys Arg Lys Arg Thr 
190 195 200 

ATT GAA ACA TCC ATT ACT CAA AA.T AAA AGC GCC CGC AAA GAA GTT TCC 1790 
lie Glu Thr Ser lie Thr Gin Asn Lys Ser Ala Arg Lys Glu Val Ser 
205 210 215 

TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC 1838 
Trp Asn Ser lie Ser lie Ser Arg Phe Ser lie Phe Tyr Arg Ser Ser 
220 225 230 

TAT AAG AAG TTT AAG CAA G GTAACTAATA CTGTTATCCT TCATAACTAA 188 7 

Tyr Lys Lys Phe Lys Gin 
235 240 

TTTTAG AT CTA TAT TTT AAC TTA CAC TCT ATT TGT GAT CGG AAC ACA 1934 
Asp Leu Tyr Phe Asn Leu His Ser lie Cys Asp Arg Asn Thr 
245 250 

GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG CAA TTT GGA CTT ATA 1982 
Val His Met Trp Leu Gin Trp lie Phe Pro Arg Gin Phe Gly Leu lie 
255 260 265 270 

AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA TCA 2 03 0 

Asn Ala Phe Gin Val Lys Gin Leu His Lys Val lie Pro Leu Val Ser 
275 280 285 

CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT 2 07 8 

Gin Ser Thr Val Val Pro Lys Arg Leu Leu Lys Val Tyr Pro Leu lie 
290 295 300 

GAA CAA ACA GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC 2126 
Glu Gin Thr Ala Lys Arg Leu His Arg lie Ser Leu Ser Lys Val Tyr 
305 310 315 

AAC CAT TAT TGC CCA TAT ATT GAC ACC CAC GAT GAT GAA AAA ATC CTT 2174 
Asn His Tyr Cys Pro Tyr lie Asp Thr His Asp Asp Glu Lys lie Leu 
320 325 330 

AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG TTT CTT CGA TCC ATT 2222 
Ser Tyr Ser Leu Lys Pro Asn Gin Val Phe Ala Phe Leu Arg Ser lie 
335 340 345 350 

CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA TTT 227 0 

Leu Val Arg Val Phe Pro Lys Leu lie Trp Gly Asn Gin Arg lie Phe 
355 360 365 

GAG ATA ATA TTA AAA G GTATTGTATA AAATTTATTA CCACTAACGA TTTTACCAG AC 232 7 
Glu lie lie Leu Lys Asp 
370 

CTC GAA ACT TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT 2375 
Leu Glu Thr Phe Leu Lys Leu Ser Arg Tyr Glu Ser Phe Ser Leu His 
375 380 385 

TAT TTA ATG AGT AAC ATA AAG GTAATATGCC AAATTTTTTT ACCATTAATT 2426 
Tyr Leu Met Ser Asn lie Lys 
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390 395 

AACAATCAG ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA AAA AGG TCA 24 74 

lie Ser Glu lie Glu Trp Leu Val Leu Gly Lys Arg Ser 
400 405 

AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT 2522 
Asn Ala Lys Met Cys Leu Ser Asp Phe Glu Lys Arg Lys Gin lie Phe 
410 415 420 

GCG GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA 2 570 

Ala Glu Phe He Tyr Trp Leu Tyr Asn Ser Phe lie He Pro He Leu 
425 430 435 440 

CAA TCT TTT TTT TAT ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT 2618 
Gin Ser Phe Phe Tyr He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr 
445 450 455 

GTT TAT TTT AGA AAA GAT ATT TGG AAA CTC TTG TGC CGA CCC TTT ATT 2666 
Val Tyr Phe Arg Lys Asp He Trp Lys Leu Leu Cys Arg Pro Phe He 
460 465 470 

ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA ATA AAC GAG GTATTTTAAA 2715 
Thr Ser Met Lys Met Glu Ala Phe Glu Lys He Asn Glu 
475 480 485 

GTATTTTTTG CAAAAAGCTA ATATTTTCAG AAC AAT GTT AGG ATG GAT ACT CAG 2769 

Asn Asn Val Arg Met Asp Thr Gin 
490 

AAA ACT ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT 2817 
Lys Thr Thr Leu Pro Pro Ala Val He Arg Leu Leu Pro Lys Lys Asn 
495 500 505 

ACC TTT CGT CTC ATT ACG AAT TTA AGA AAA AGA TTC TTA ATA AAG 2862 
Thr Phe Arg Leu He Thr Asn Leu Arg Lys Arg Phe Leu He Lys 
510 515 520 

GTATTAATTT TTGGTCATCA ATGTACTTTA CTTCTAATCT ATTATTAGCA G ATG GGT 2 919 

Met Gly 
525 

TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 296 7 

Ser Asn Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leu Arg Pro Val 
530 535 540 

GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA 3 015 

Ala Ser He Leu Lys His Leu lie Asn Glu Glu Ser Ser Gly He Pro 
545 550 555 

TTT AAC TTG GAG GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT 3 0 63 

Phe Asn Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys Lys Asp Leu 
560 565 570 

CTT AAG CAC CGA ATG TTT GG GTAATTATAT AATGCGCGAT TCCTCATTAT 3113 
Leu Lys His Arg Met Phe Gly 
575 580 

TAATTTTGCA G G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA AAA TCC 3161 
Arg Lys Lys Tyr Phe Val . Arg He Asp He Lys Ser 
585 590 
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TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG 
Cys Tyr Asp Arg lie Lys Gin Asp Leu Met Phe Arg lie Val Lys Lys 
595 600 605 

AAA CTC AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT 
Lys Leu Lys Asp Pro Glu Phe Val lie Arg Lys Tyr Ala Thr lie His 
610 615 620 625 

GCA ACA AGT GAC CGA GCT ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC 
Ala Thr Ser Asp Arg Ala Thr Lys Asn Phe Val Ser Glu Ala Phe Ser 
630 635 640 

TAT T GTAAGTTTAT TTTTTCATTG GAATTTTTTA ACAAATTCTT TTTTAG TT 
Tyr Phe 



GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 
Asp Met Val Pro Phe Glu Lys Val Val Gin Leu Leu Ser Met Lys Thr 
645 650 655 

TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT 
Ser Asp Thr Leu Phe Val Asp Phe Val Asp Tyr Trp Thr Lys Ser Ser 
660 665 670 675 

TCT GAA ATT TTT AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT 
Ser Glu lie Phe Lys Met Leu Lys Glu His Leu Ser Gly His lie Val 
680 685 690 

AAG GTATACCAAT TGTTGAATTG TAATAACACT AATGAAACTA G ATA GGA AAT 
Lys He Gly Asn 

695 

TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA ATT CTG TCA 
Ser Gin Tyr Leu Gin Lys Val Gly lie Pro Gin Gly Ser lie Leu Ser 
700 705 710 

TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA 
Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu 
715 720 725 

TCG TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT 
Ser Phe Thr Lys Lys Lys Gly Ser Val Leu Leu Arg Val Val Asp Asp 
730 735 740 

TTC CTC TTT ATA ACA GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT 
Phe Leu Phe He Thr Val Asn Lys Lys Asp Ala Lys Lys Phe Leu Asn 
745 750 755 

TTA TCT TTA AGA G GTGAGTTGCT GTCATTCCTA AGTTCTAACC GTTGAAG GA 
Leu Ser Leu Arg Gly 
760 

TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA ATA AAC 
Phe Glu Lys His Asn Phe Ser Thr Ser Leu Glu Lys Thr Val He Asn 
765 770 775 780 

TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC 
Phe Glu Asn Ser Asn Gly He He Asn Asn Thr Phe Phe Asn Glu Ser 
785 790 795 

AAG AAA AGA ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT 
Lys Lys Arg Met Pro Phe Phe Gly Phe Ser Val Asn Met Arg Ser Leu 



3257 



3305 



3357 



3405 



3453 



3501 



3554 



3602 



3650 



3698 



3746 



3798 



3846 



3894 



3942 
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800 805 810 

GAT ACA TTG TTA GCA TGT CCT AAA ATT GAT GAA GCC TTA TTT AAC TCT 3990 
Asp Thr Leu Leu Ala Cys Pro Lys lie Asp Glu Ala Leu Phe Asn Ser 
815 820 825 

ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG AAA TCT TTT TTT TAC AAA 4 038 

Thr Ser Val Glu Leu Thr Lys His Met Gly Lys Ser Phe Phe Tyr Lys 
830 835 840 

ATT CTA AG GTATACTGTG TAACTGAATA ATAGCTGACA AATAATCAG A TCG 4 08 9 

lie Leu Arg Ser 

845 

AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA 4137 
Ser Leu Ala Ser Phe Ala Gin Val Phe lie Asp lie Thr His Asn Ser 
850 855 860 

AAA TTC AAT TCT TGC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT 4185 
Lys Phe Asn Ser Cys Cys Asn lie Tyr Arg Leu Gly Tyr Ser Met Cys 
865 870 875 880 

ATG AGA GCA CAA GCA TAC TTA AAA AGG ATG AAG GAT ATA TTT ATT CCC 4233 
Met Arg Ala Gin Ala Tyr Leu Lys Arg Met Lys Asp lie Phe lie Pro 
885 890 895 

CAA AGA ATG TTC ATA ACG G GTGAGTACTT ATTTTAACTA GAAAAGTCAT 4282 
Gin Arg Met Phe lie Thr 
900 

TAATTAACCT TAG AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA 4330 
Asp Leu Leu Asn Val lie Gly Arg Lys lie Trp Lys 
905 910 

AAG TTG GCC GAA ATA TTA GGA TAT ACG AGT AGG CGT TTC TTG TCC TCT 43 78 

Lys Leu Ala Glu lie Leu Gly Tyr Thr Ser Arg Arg Phe Leu Ser Ser 
915 920 925 930 

GCA GAA GTC AAA TG GTACGTGTCG GTCTCGAGAC TTCAGCAATA TTGACACATC 4432 
Ala Glu Val Lys Trp 
935 

AG G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA CCC TCT TTC AAA 44 80 

Leu Phe Cys Leu Gly Met Arg Asp Gly Leu Lys Pro Ser Phe Lys 
940 945 950 

TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 452 8 

Tyr His Pro Cys Phe Glu. Gin Leu He Tyr Gin Phe Gin Ser Leu Thr 
955 960 965 

GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA 4576 
Asp Leu He Lys Pro Leu Arg Pro Val Leu Arg Gin Val Leu Phe Leu 
970 975 980 

CAT AGA AGA ATA GCT GAT TAATGTCATT TTCAATTTAT TATATACATC 4 624 

His Arg Arg He Ala Asp 
985 

CTTTATTACT GGTGTCTTAA ACAATATTAT T ACT AAGTAT AGCTGACCCC CAAAGCAAGC 4684 
ATACTATAGG ATTTCTAGTA AAGTAAAATT AATCTCGTTA TTAGTTTTGA TTGACTTGTC 4 744 
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TTTATCCTTA 


TACTTTTAAG 


AAAGATTGAC 


AGTGGTTGCT 


GACTACTGCC 


CACATGCCCA 


4804 


TTAAACGGGA 


GTGGTTAAAC 


ATTAAAAGTA 


ATACATGAGG 


CTAATCTCCT 


TTCATTTAGA 


4864 


ATAAGGAAAG 


TGGTTTTCTA 


TAATGAATAA 


TGCCCGCACT 


AATGCAAAAA 


GACGAAGATT 


4924 


ATCTTCTAAA 


CAAGGGGGAT 


TAAGCATATC 


CGAAGGAAAA 


GAGAGTAATA 


TACCCAGTGT 


4984 


TGTTGAAGAA 


AGCAAGGATA 


ATTTGGAACA 


AGCTTCTGCA 


GATGACAGGC 


TAAATTTTGG 


5044 


TGACCGAATT 


TTGGTAAAAG 


CCCCAGGTTA 


T'CCATGGTGG 


CCGGCCTTGC 


TACTGAGACG 


5104 


AAAAGAAACT 


AAGGATAGTT 


TGAATACTAA 


TAGCTCATTT 


AATGTCTTAT 


ATAAGGTTTT 


5164 


GTTTTTTCCT 


GACTTCAATT 


TTGCATGGGT 


GAAAAGAAAT 


AGTGTTAAGC 


CATTATTGGA 


5224 
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TTATGAGGCT 


TCAAAAACTC 


CTCCTGATTT 


AAAGGAGGAA 


TCTTCCACCG 


ATGAGGAAAT 


5344 


GGATAGCTTA 


TCAGCTGCTG 


AGGAGAAGCC 


TAATTTTTTG 


CAAAAAAGAA 


AATATCATTG 


5404 


GGAGACATCT 


CTTGATGAAT 


CAGATGCGGA 


GAGTATCTCC 


AGCGGATCCT 


TGATGTCAAT 


. 5464 


AACTTCTATT 


TCTGAAATGT 


ATGGTCCTAC 


TGTCGCTTCG 


ACTTCTCGTA 


GCTCTACGCA 


5524 


GTTAAGTGAC 


CAAAGGTACC 










5544 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 69 : 

Met Thr Glu His His Thr Pro Lys Ser Arg lie Leu Arg Phe Leu Glu 
1 ■ 5 10 15 

Asn Gin Tyr Val Tyr Leu Cys Thr Leu Asn Asp Tyr Val Gin Leu Val 
20 25 30 

Leu Arg Gly Ser Pro Ala Ser Ser Tyr Ser Asn lie Cys Glu Arg Leu 
35 40 45 

Arg Ser Asp Val Gin Thr Ser Phe Ser He Phe Leu His Ser Thr Val 
50 55 60 

Val Gly Phe Asp Ser Lys Pro Asp Glu Gly Val Gin Phe Ser Ser Pro 
65 70 75 80 

Lys Cys Ser Gin Ser Glu Leu He Ala Asn Val Val Lys Gin Met Phe 
85 90 95 

Asp Glu Ser Phe Glu Arg Arg Arg Asn Leu Leu Met Lys Gly Phe Ser 
100 105 110 

Met Asn His Glu Asp Phe Arg Ala Met His Val Asn Gly Val Gin Asn 
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115 



120 



125 



Asp Leu Val Ser Thr Phe Pro Asn Tyr Leu lie Ser lie Leu Glu Ser 
130 135 140 

Lys Asn Trp Gin Leu Leu Leu Glu lie He Gly Ser Asp Ala Met His 
145 150 155 160 

Tyr Leu Leu Ser Lys Gly Ser lie Phe Glu Ala Leu Pro Asn Asp Asn 
165 170 175 

Tyr Leu Gin lie Ser Gly He Pro Leu Phe Lys Asn Asn Val Phe Glu 
180 185 190 

Glu Thr Val Ser Lys Lys Arg Lys Arg Thr He Glu Thr Ser He Thr 
195 200 205 

Gin Asn Lys Ser Ala Arg Lys Glu Val Ser Trp Asn Ser He Ser He 
210 215 220 

Ser Arg Phe Ser He Phe Tyr Arg Ser Ser Tyr Lys Lys Phe Lys Gin 
225 230 235 240 

Asp Leu Tyr Phe Asn Leu His Ser He Cys Asp Arg Asn Thr Val His 
245 250 255 

Met Trp Leu Gin Trp He Phe Pro Arg Gin Phe Gly Leu He Asn Ala 
260 265 270 

Phe Gin Val Lys Gin Leu His Lys Val He Pro Leu Val Ser Gin Ser 
275 280 285 

Thr Val Val Pro Lys Arg Leu Leu Lys Val Tyr Pro Leu He Glu Gin 
290 295 300 

Thr Ala Lys Arg Leu His Arg He Ser Leu Ser Lys Val Tyr Asn His 
305 310 315 320 

Tyr Cys Pro Tyr He Asp Thr His Asp Asp Glu Lys He Leu Ser Tyr 
325 330 335 

Ser Leu Lys Pro Asn Gin Val Phe Ala Phe Leu Arg Ser He Leu Val 
340 345 350 

Arg Val Phe Pro Lys Leu lie Trp Gly Asn Gin. Arg He Phe Glu He 
355 360 365 

He Leu Lys Asp Leu Glu Thr Phe Leu Lys Leu Ser Arg Tyr Glu Ser 
370 375 380 

Phe Ser Leu His Tyr Leu Met Ser Asn lie Lys He Ser Glu He Glu 
385 390 395 400 

Trp Leu Val Leu Gly Lys Arg Ser Asn Ala Lys Met Cys Leu Ser Asp 
405 410 415 

Phe Glu Lys Arg Lys Gin He Phe Ala Glu Phe He Tyr Trp Leu Tyr 
420 425 430 

Asn Ser Phe He He Pro He Leu Gin Ser Phe Phe Tyr He Thr Glu 



435 



440 



445 
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Ser Ser Asp Leu Arg 
450 

Lys Leu Leu Cys Arg 
465 

Glu Lys lie Asn Glu 
485 

Leu Pro Pro Ala Val 
500 

Leu lie Thr Asn Leu 
515 

Lys Lys Met Leu Val 
530 

lie Leu Lys His Leu 
545 

Leu Glu Val Tyr Met 
565 

His Arg Met Phe Gly 
580 

Ser Cys Tyr Asp Arg 
595 

Lys Lys Leu Lys Asp 
610 

His Ala Thr Ser Asp 
625 

Ser Tyr Phe Asp Met 
645 

Met Lys Thr Ser Asp 
660 

Lys Ser Ser Ser Glu 
675 

His lie Val Lys lie 
690 

Pro Gin Gly Ser lie 
705 

Asp Leu lie Asp Glu 
725 

Leu Leu Arg Val Val 
740 

Asp Ala Lys Lys Phe 
755 

Asn Phe Ser Thr Ser 

770 




Asn Arg Thr Val Tyr Phe 
455 

Pro Phe lie Thr Ser Met 
470 475 

Asn Asn Val Arg Met Asp 
490 

lie Arg Leu Leu Pro Lys 
505 

Arg Lys Arg Phe Leu lie 
520 

Ser Thr Asn Gin Thr Leu 
535 

lie Asn Glu Glu Ser Ser 
550 555 

Lys Leu Leu Thr Phe Lys 
570 

Arg Lys Lys Tyr Phe Val 
585 

lie Lys Gin Asp Leu Met 
600 

Pro Glu Phe Val lie Arg 
615 

Arg Ala Thr Lys Asn Phe 
630 635 

Val Pro Phe Glu Lys Val 
650 

Thr Leu Phe Val Asp Phe 
665 

lie Phe Lys Met Leu Lys 
680 

Gly Asn Ser Gin Tyr Leu 
'695 

Leu Ser Ser Phe Leu Cys 
710 715 

Tyr Leu Ser Phe Thr Lys 
730 

Asp Asp Phe Leu Phe lie 
745 

Leu Asn Leu Ser Leu Arg 
760 

Leu Glu Lys Thr Val lie 
775 



• 



Arg Lys Asp lie' Trp 
460 

Lys Met Glu Ala Phe 
480 

Thr Gin Lys Thr Thr 
495 

Lys Asn Thr Phe Arg 
510 

Lys Met Gly Ser Asn 
525 

Arg Pro Val Ala Ser 
540 

Gly lie Pro Phe Asn 
560 

Lys Asp Leu Leu Lys 
575 

Arg lie Asp He Lys 
590 

Phe Arg lie Val Lys 
605 

Lys Tyr Ala Thr lie 
620 

Val Ser Glu Ala Phe 
640 

Val Gin Leu Leu Ser 
655 



Val Asp Tyr Trp Thr 
670 

Glu His Leu Ser Gly 
685 

Gin Lys Val Gly He 
700 

His Phe Tyr Met Glu 
720 

Lys Lys Gly Ser Val 
735 

Thr Val Asn Lys Lys 
750 

Gly Phe Glu Lys His 
765 

Asn Phe Glu Asn Ser 
780 
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Asn Gly He He Asn Asn Thr Phe Phe Asn Glu Ser Lys Lys Arg Met 
785 790 795 800 

Pro Phe Phe Gly Phe Ser Val Asn Met Arg Ser Leu Asp Thr Leu Leu 
805 810 815 

Ala Cys Pro Lys He Asp Glu Ala Leu Phe Asn Ser Thr Ser Val Glu 
820 825 830 

Leu Thr Lys His Met Gly Lys Ser Phe Phe Tyr Lys He Leu Arg Ser 
835 840 845 

Ser Leu Ala Ser Phe Ala Gin Val Phe He Asp He Thr His Asn Ser 
850 855 860 

Lys Phe Asn Ser Cys Cys Asn He Tyr Arg Leu Gly Tyr Ser Met Cys 
865 870 875 880 

Met Arg Ala Gin Ala Tyr Leu Lys Arg Met Lys Asp He Phe He Pro 
885 890 895 

Gin Arg Met Phe He Thr Asp Leu -Leu Asn Val He Gly Arg Lys He 
900 905 910 

Trp Lys Lys Leu Ala Glu He Leu Gly Tyr Thr Ser Arg Arg Phe Leu 
915 920 925 

Ser Ser Ala Glu Val Lys Trp Leu Phe Cys Leu Gly Met Arg Asp Gly 
930 935 940 

Leu Lys Pro Ser Phe Lys Tyr His Pro Cys Phe Glu Gin Leu He Tyr 
945 950 955 960 

Gin Phe Gin Ser Leu Thr Asp Leu He Lys Pro Leu Arg Pro Val Leu 
965 970 975 

Arg Gin Val Leu Phe Leu His Arg Arg He Ala Asp 
980 985 



(2) INFORMATION FOR SEQ ID NO: 70: 

( i ) SEQUENCE CHARACTERISTICS : 
. (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: modif ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = guanos ine modified by 
biotin group" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 70: 
NCCTATTTYT TYTAYNNNAC NGA 



61 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Phe Phe Tyr Xaa Thr Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
CCAGATATNA DNARRAARTC RTC 23 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /products "OTHER" 

■/note- "Xaa = Phe, lie or Leu" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Asp Asp Phe Leu Xaa lie 
1 5 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
ACAATGMGNH TNHTNCCNAA RAA ♦ 23 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 2. .3 

(D) OTHER INFORMATION : /product^ "OTHER" 

/note= "Xaa = Leu or lie" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Arg Xaa Xaa Pro Lys Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
ACGAATCKNG GDATNSWRTC RTARCA 26 

(2) INFORMATION FOR SEQ ID NO : 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Cys Tyr Asp Ser lie Pro Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 78: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 
CAATTCTCRT ARCANSWYTT DATRTC 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Asp lie Lys Ser Cys Tyr Asp 
1 5 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 
GATTACTCCC GAAGAAAGGA TCTTTCCGTC CAATCATGAC TTTCTTAAGA AAGGACAAGC 
AAAAAAATAT TAAGTTAAAT CTAAATTAAA TTCTAATGGA TAGCCAACTT GTGTTTAGGA 
ATTTAAAAGA CATGCTGGGA TAAAAGATAG GATACTCAGT CTTTGATAAT AAACAAATTT 
CAGAAAAATT TGCCTAATTC ATAGAGAAAT GGAAAAATAA AGGAAGACCT CAGCTATATT 
ATGTCACTCT AGACATAAAG ACTTGCTAC 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

AAACACAAGG AAGGAAGTCA AATATTCTAT TACCGTAAAC CAATATGGAA ATTAGTGAGT 60 

AAATTAACTA TTGTCAAAGT AAGAATTTAG TTTTCTGAAA AGAATAAATA AATGAAAAAT 12 0 

AATTTTTATC AAAAAATTTA GCTTGAAGAG GAGAATTTGG AAAAAGTTGA AGAAAAATTG 180 

AT AC C AG AAG ATTCATTTTA GAAATACCCT CAAGGAAAGC TAAGGATTAT ACCTAAAAAA 240 

GGATCTTTCC GTCCAATCAT GACTTTCTTA AGAAAGGACA AGCAAAAAAA TATTAAGTTA 300 

AATCTAAATT AAATTCTAAT GGATAGCCAA CTTGTGTTTA GGAATTTAAA AGACATGCTG 360 

GGATAAAAGA TAGGATACTC AGTCTTTGAT AATAAACAAA TTTCAGAAAA ATTTGCCTAA 42 0 

TTCATAGAGA AATGGAAAAA TAAAGGAAGA CCTCAGCTAT ATTATGTCAC TCTA 474 
(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 
-(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Lys His Lys Glu Gly Ser Gin lie Phe Tyr Tyr Arg Lys Pro He Trp 
1 5 10 15 

Lys Leu Val Ser Lys Leu Thr He Val Lys Val Arg He Gin Phe Ser 
20 25 30 

Glu Lys Asn Lys Gin Met Lys Asn Asn Phe Tyr Gin Lys lie Gin Leu 
35 40 45 

Glu Glu Glu Asn Leu Glu . Lys Val Glu Glu Lys Leu He Pro Glu Asp 
50 55 60 

Ser Phe Gin Lys Tyr Pro Gin Gly Lys Leu Arg He lie Pro Lys Lys 
65 70 75 80 

Gly Ser Phe Arg Pro He Met Thr Phe Leu Arg Lys. Asp Lys Gin Lys 
85 90 95 

Asn He Lys Leu Asn Leu Asn Gin He Leu Met Asp Ser Gin Leu Val 
100 105 110 

Phe Arg Asn Leu Lys Asp Met Leu Gly Gin Lys He Gly Tyr Ser Val 
115 120 125 

Phe Asp Asn Lys Gin He Ser Glu Lys Phe Ala Gin Phe He Glu Lys 

65 



130 135 140 

Trp Lys Asn Lys Gly Arg Pro Gin Leu Tyr Tyr Val Thr Leu 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Phe Phe Tyr Cys Thr Glu lie Ser Ser Thr Val Thr lie Val Tyr Phe 
1 5 10 15 

Arg His Asp Thr Trp Asn Lys Leu He Thr Pro Phe He Val Glu Tyr 
20 25 30 

Phe Lys Thr Tyr Leu Val Glu Asn Asn Val Cys Arg Asn His Asn Ser 
35 40. 45 

Tyr Thr Leu Ser Asn Phe Asn His Ser Lys Met Arg He He Pro Lys 
50 55 60 

Lys Ser Asn Asn Glu Phe Arg He He Ala He Pro Cys Arg Gly Ala 
65 70 75 80 

Asp Glu Glu Glu Phe Thr He Tyr Lys Glu Asn His Lys Asn Ala He 
85 90 95 

Gin Pro Thr Gin Lys He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr 
100 105 110 

Ser Phe Thr Lys He Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys 
115 120 125 

Glu Phe Lys Gin Arg Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu 
130 135 140 

Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
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Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr 
15 10 15 

Tyr Arg Lys Asn lie Trp Asp Val lie Met Lys Met Ser lie Ala Asp 
20 25 30 

Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu Glu 
35 40 45 

Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu lie Pro 
50 55 60 

Lys Lys Thr Thr Phe Arg Pro lie Met Thr Phe Asn Lys Lys He Val 
65 70 75 80 

Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu 
85 90 95 

Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys Asp 
100 105 110 

Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr 
115 120 125 

Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe 
130 135 140 

Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

Lys His Lys Glu Gly Ser Gin He Phe Tyr Tyr Arg Lys Pro He Trp 
1 5 10 15 

Lys Leu Val Ser Lys Leu Thr He Val Lys Val Arg He Gin Phe Ser 
20 25 30 

Glu Lys Asn Lys Gin Met Lys Asn Asn Phe Tyr Gin Lys He Gin Leu 
35 .40 45 

Glu Glu Glu Asn Leu Glu Lys Val Glu Glu Lys Leu He Pro Glu Asp 
50 55 60 

Ser Phe Gin Lys Tyr Pro Gin Gly Lys Leu Arg He He Pro Lys Lys 
65 70 75 80 

Gly Ser Phe Arg Pro He Met Thr Phe Leu Arg Lys Asp Lys Gin Lys 
85 90 95 
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Asn lie Lys Leu Asn Leu Asn Gin lie Leu Met Asp Ser Gin Leu Val 
100 105 110 

Phe Arg Asn Leu Lys Asp Met Leu Gly Gin Lys lie Gly Tyr Ser Val 
115 120 125 

Phe Asp Asn Lys Gin lie Ser Glu Lys Phe Ala Gin Phe lie Glu Lys 
130 135 140 

Trp Lys Asn Lys Gly Arg Pro Gin Leu Tyr Tyr Val Thr Leu 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 86: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

Glu Val Asp Val Asp Asn Gin Ala Asp Asn His Gly lie His Ser Ala 
1 5 10 15 

Leu Lys Thr Cys Glu Glu lie Lys Glu Ala Lys Thr Leu Tyr Ser Trp 
20 25 30 

lie Gin Lys Val lie Arg Cys Arg Asn Gin Ser Gin Ser His Tyr Lys 
35 40 45 

Asp Leu Glu Asp lie Lys lie Phe Ala Gin Thr Asn lie Val Ala Thr 
50 55 60 

Pro Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val lie Ala Arg Lys Glu 
65 70 75 80 

Val Phe Ser Thr Gly Leu Met lie Glu Leu lie Asp Lys Cys Leu Val 
85 90 95 

Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin Cys 
100 105 110 

Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu Leu 
115 120 125 

Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp Asn 
130 135 140 

Gin Val Arg Ala Met lie Gly Asn Glu Leu Phe Arg His Leu Tyr Thr 
145 . 150 155 160 

Lys Tyr Leu lie Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin Phe 
165 170 175 

Cys Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe Asp 
180 185 190 
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Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys Cys 
195 200 205 

Ser Thr Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu Asn 
210 215 220 

Asn lie Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg lie 
225 230 235 240 

Phe Tyr Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys His 
245 250 255 

Glu Phe Val Ser Asn Lys Asn Asn lie Ser Ala Met Asp Arg Ala Gin 
260 265 270 

Thr lie Phe Thr Asn lie Phe Arg Phe Asn Arg He Arg Lys Lys Leu 
275 280 285 

Lys Asp Lys Val lie Glu Lys lie Ala Tyr Met Leu Glu Lys Val Lys 
290 295 300 

Asp Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro Glu 
305 310 315 320 

Asn Trp Arg Glu Arg Lys Gin Lys lie Glu Asn Leu lie Asn Lys Thr 
325 330 335 

Arg Glu Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr Thr 
340 345 350 

Asp Asn Lys Cys Val Thr Gin Phe lie Asn Glu Phe Phe Tyr Asn He 
355 360 365 

Leu Pro Lys Asp Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin Lys 
370 375 . 380 

Lys Val Lys Lys Tyr Val Glu Leu Asn Lys His Glu Leu He His Lys 
385 390 395 400 

Asn Leu Leu Leu Glu Lys He Asn Thr Arg Glu He Ser Trp Met Gin 
405 410 415 

Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn He 
420 425 430 

Tyr Val Leu Trp Lys Leu Leu Arg -Trp He Phe Glu Asp Leu Val Val 
435 . 440 445 

Ser Leu He Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser Tyr 
450 455 460 

Ser Lys Thr Tyr Tyr Tyr Arg Lys Asn He Trp Asp Val He Met Lys 
465 470 475 480 

Met Ser He Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu 
485 490 495 

Lys Glu. Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys 
500 505 510 

Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met Thr Phe 
515 520 525 
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Asn Lys Lys lie Val 
530 

Asn Thr Lys Leu Leu 
545 

Arg Met Phe Lys Asp 
565 

Val Met Lys Lys Tyr 
580 

Gin Pro Lys Leu Phe 
595 

Ser Val Asn Arg Glu 
610 

Leu Ser Ser Asp Phe 
625 

Asn Asn lie Val lie 
645 

Asp Tyr Phe Arg Gin 
660 

Tyr Pro Thr Leu Phe 
675 

Ala Lys Lys Thr Leu 
690 

Lys Asp Asn Leu Leu 
705 

lie Asn Phe Asn Gly 
725 

Gly Leu Cys Val Ser 
740 

Glu Glu Ser Ser Leu 
755 

Asn Pro Asn Val Asn 
770 

lie Thr Thr Gin Glu 
785 

Asn Val Ser Arg Glu 
805 

Thr Ser Phe Pro Leu 
820 

Ser Val Glu Glu Gin 
835 

lie Ser lie Asp Met 
850 



Asn Ser Asp Arg Lys Thr 
535 

Asn Ser His Leu Met Leu 
550 555 

Pro Phe Gly Phe Ala Val 
570 

Glu Glu Phe Val Cys Lys 
585 

Phe Ala Thr Met Asp lie 
600 

Lys Leu Ser Thr Phe Leu 
615 

Trp lie Met Thr Ala Gin 
630 635 

Asp Ser Lys Asn Phe Arg 
650 

Lys Phe Gin Lys lie Ala 
665 

Ser Val Leu Glu Asn Glu 
680 

lie Val Glu Ala Lys Gin 
695 

Gin Pro Val lie Asn He 
710 715 

Lys Phe Tyr Lys Gin Thr 
730 

Ser He Leu Ser Ser Phe 
745 

Gly Phe Leu Arg Asp Glu 
760 

Leu Leu Met Arg Leu Thr 

775 

Asn Asn Ala Val Leu Phe 
790 795 

Asn Gly Phe Lys Phe Asn 
810 

Ser Pro Ser Lys Phe Ala 
825 

Asn He Val Gin Asp Tyr 
840 

Lys Thr Leu Ala Leu Met 
855 



Thr Lys Leu Thr Thr 
540 

Lys Thr Leu Lys Asn 
560 

Phe Asn Tyr Asp Asp 
575 

Trp Lys Gin Val Gly 
590 

Glu Lys Cys Tyr Asp 
605 

Lys Thr Thr Lys Leu 
620 

He Leu Lys Arg Lys 
640 

Lys Lys Glu Met Lys 
655 

Leu Glu Gly Gly Gin 
670 

Gin Asn Asp Leu Asn 
685 

Arg Asn Tyr Phe Lys 
700 

Cys Gin Tyr Asn Tyr 
720 

Lys Gly He Pro Gin 
735 

Tyr Tyr Ala Thr Leu 
750 

Ser Met Asn Pro Glu 
765 

Asp Asp Tyr Leu Leu 

780 

He Glu Lys Leu He 
800 

Met Lys Lys Leu Gin 
815 

Lys Tyr Gly Met Asp 
830 

Cys Asp Trp He Gly 
845 

Pro Asn He Asn Leu 
860 
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Arg lie Glu Gly lie Leu Cys Thr Leu Asn Leu Asn Met Gin Thr Lys 
865 870 875 880 

Lys Ala Ser Met Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met Asn 
885 890 895 

Asn lie Thr His Tyr Phe Arg Lys Thr lie Thr Thr Glu Asp Phe Ala 
900 905 910 

Asn Lys Thr Leu Asn Lys Leu Phe He Ser Gly Gly Tyr Lys, Tyr Met 
915 920 925 

Gin Cys Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala Met 
930 935 940 

Ser Ser Met He Asp Leu Glu Val Ser Lys lie He Tyr Ser Val Thr 
945 950 955 960 

Arg Ala Phe Phe Lys Tyr Leu Val Cys Asn He Lys Asp Thr He Phe 
965 970 975 

Gly Glu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His Phe 
980 985 990 

He Glu He Phe Ser Thr Lys Lys Tyr He Phe Asn Arg Val Cys 
995 1000 1005 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GTGAAGGCAC TGTTCAGCG 19 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

GTGGATGATT TCTTGTTGG 19 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 89 
ATGCTCCTGC GTTTGGTGG 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90 
CTGGACACTC AGCCCTTGG 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91 
GGCAGGTGTG CTGGACACT 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 92 
TTTGATGATG CTGGCGATG 



(2) INFORMATION FOR SEQ ID NO: 93: 
(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
GGGGCTCGTC TTCTACAGG 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 
CAGCAGGAGG ATCTTGTAG 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95 
TGACCCCAGG AGTGGCACG 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96 
TCAAGCTGAC TCGACACCG 



(2) INFORMATION FOR SEQ ID NO: 97: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 97: 
CGGCGTGACA GGGCTGC 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GCTGAAGGCT GAGTGTCC 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TAGTC CATGT TCACAATCG 



(2) INFORMATION FOR SEQ ID NO:100: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 22.. 1716 

(D) OTHER INFORMATION: /note= "EcoRI-NotI insert of 

clone 712562 encoding 63 kDa 
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telomerase. protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



GCCAAGTTCC 


TGCACTGGCT 


GATGAGTGTG 


TACGTCGTCG 


AGCTGCTCAG 


GTCTTTCTTT 


60 


TATGTCACGG 


AGACCACGTT 


TCAAAAGAAC 


AGGCTCTTTT 


TCTACCGGAA 


GAGTGTCTGG 


12 0 


AGCAAGTTGC 


AAAGCATTGG 


AATCAGACAG 


CACTTGAAGA 


GGGTGCAGCT 


GCGGGAGCTG 


180 


TCGGAAGCAG 


AGGTCAGGCA 


GCATCGGGAA 


GCCAGGCCCG 


CCCTGCTGAC 


GTCCAGACTC 


240 


CGCTTCATCC 


CCAAGCCTGA 


CGGGCTGCGG 


CCGATTGTGA 


ACATGGACTA 


CGTCGTGGGA 


300 


GCCAGAACGT 


TCCGCAGAGA 


AAAGAKGGCC 


GAGCGTCTCA 


CCTCGAGGGT 


GAAGGCACTG 


360 


TTCAGCGTGC 


TCAACTACGA 


GCGGGCGCGG 


CGCCCCGGCC 


TCCTGGGCGC 


CTCTGTGCTG 


420 


GGCCTGGACG 


ATATCCACAG 


GGCCTGGCGC 


ACCTTCGTGC 


TGCGTGTGCG 


GGCCCAGGAC 


480 


CCGCCGCCTG 


AGCTGTACTT 


TGTCAAGGTG 


GATGTGACGG 


GCGCGTACGA. 


CACCATCCCC 


540 


CAGGACAGGC 


TCACGGAGGT 


CATCGCCAGC 


ATCATCAAAC 


CCCAGAACAC 


GTACTGCGTG 


600 


CGTCGGTATG 


CCGTGGTCCA 


GAAGGCCGCC 


ATGGGCACGT 


CCGCAAGGCC 


TTCAAGAGCC 


660 


ACGTCCTACG 


TCCAGTGCCA 


GGGGATCCCG 


CAGGGCTCCA 


TCCTCTCCAC 


GCTGCTCTGC 


720 


AGCCTGTGCT 


ACGGCGACAT 


GGAGAACAAG 


CTGTTTGCGG 


GGATTCGGCG 


GGACGGGCTG 


780 


CTCCTGCGTT 


TGGTGGATGA 


TTTCTTGTTG 


GTGACACCTC 


ACCTCACCCA 


CGCGAAAACC 


840 


TTCCTCAGGA 


CCCTGGTCCG 


AGGTGTCCCT 


GAGTATGGCT 


GCGTGGTGAA 


CTTGCGGAAG 


900 


AC AGTGGTGA 


ACTTCCCTGT 


AGAAGACGAG 


GCCCTGGGTG 


GCACGGCTTT 


TGTTCAGATG 


960 


CCGGCCCACG 


GCCTATTCCC 


CTGGTGCGGC 


CTGCTGCTGG 


ATACCCGGAC 


CCTGGAGGTG 


1020 


CAGAGCGACT 


ACTCCAGCTA 


TGCCCGGACC 


TCCATCAGAG 


CCAGTCTCAC 


CTTCAACCGC 


1080 


GGCTTCAAGG 


CTGGGAGGAA 


CATGCGTCGC 


AAACTCTTTG 


GGGTCTTGCG 


GCTGAAGTGT 


1140 


CACAGCCTGT 


TTCTGGATTT 


GCAGGTGAAC 


AGCCTCCAGA 


CGGTGTGCAC 


CAACATCTAC 


1200 


AAGATCCTCC 


TGCTGCAGGC 


GTACAGGTTT 


CACGCATGTG 


TGCTGCAGCT 


CCCATTTCAT 


1260 


CAGCAAGTTT 


GGAAGAACCC 


ACATTTTTCC 


TGCGCGTCAT 


CTCTGACACG 


GCTCCCTCTG 


1320 


CTACTCCATC 


CTGAAAGCCA 


AGAACGCAGG 


GATGTCGCTG 


GGGGCCAAGG 


GCGCCGCCGG 


1380 


CCCTCTGCCC 


TCCGAGGCCG 


TGCAGTGGCT 


GTGCCACCAA 


GCATTCCTGC 


TCAAGCTGAC 


1440 


TCGACACCGT 


GTCACCTACG 


TGCCACTCCT 


GGGGTCACTC 


AGGACAGCCC 


AGACGCAGCT 


1500 


GAGT CGGAAG 


CTCCCGGGGA 


CGACGCTGAC 


TGCCCTGGAG 


GCCGCAGCCA 


ACCCGGCACT 


1560 


GCCCTCAGAC 


TTCAAGACCA 


TCCTGGACTG 


ATGGCCACCC 


GCCCACAGCC 


AGGCCGAGAG 


1620 


CAGACACCAG 


CAGCCCTGTC 


ACGCCGGGCT 


TATACGTCCC 


AGGGAGGGAG 


GGGCGGCCCA 


1680 


CACCCAGGCC 


TGCACCGCTG 


GGAGTCTGAG 


GCCTGAGTGA 


GTGTTTGGCC 


GAGGCCTGCA 


1740 
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TGTCCGGCTG AAGGCTGAGT GTCCGGCTGA GGCCTGAGCG AGTGTCCAGC CAAGGGCTGA 18 00 

GTGTCCAGCA CACCTGCGTT TTCACTTCCC CACAGGCTGG CGTTCGGTCC ACCCCAGGGC 186 0 

CAGCTTTTCC TCACCAGGAG CCCGGCTTCC ACTCCCCACA TAGGAATAGT CCATCCCCAG 1920 

ATTCGCCATT GTTCACCCTT CGCCCTGCCT TCCTTTGCCT TCCACCCCCA CCATTCAGGT 1980 

GGAGACCCTG AGAAGGACCC TGGGAGCTTT GGGAATTTGG AGTGACCAAA GGTGTGCCCT 2040 

GTACACAGGC GAGGACCCTG CACCTGGATG GGGGTCCCTG TGGGTCAAAT TGGGGGGAGG 2100 

TGCTGTGGGA GTAAAATACT GAATATATGA GTTTTTCAGT TTTGGAAAAA AAAAAAAAAA 2160 

AAAAAAAAAA A 2171 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY: Protein 

(B) LOCATION: 1. .564 

(D) OTHER INFORMATION: /note= "63 kDa telomerase protein 

encoded by ORF of EcoRI-NotI insert of 
clone 712562" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 101: 

Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr 
1 5 10 15 

Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val 
20 25 30 

Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu Lys Arg Val 
35 40 45 

Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala 
50 55 60 

Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro Asp 
65 70 75 80 

Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr 
85 90 95 

Phe Arg Arg Glu Lys Xaa Ala Glu Arg Leu Thr Ser Arg Val Lys Ala 
100 105 110 

Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu 
115 120 125 
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Gly Ala Ser Val Leu 
130 

Phe Val Leu Arg Val 
145 

Val Lys Val Asp Val 
165 

Leu Thr Glu Val lie 
180 

Val Arg Arg Tyr Ala 
195 

Arg Pro Ser Arg Ala 
210 

Gly Ser lie Leu Ser 
225 

Glu Asn Lys Leu Phe 
245 

Leu Val Asp Asp Phe 
260 

Thr Phe Leu Arg Thr 
275 

Val Asn Leu Arg Lys 
290 

Leu Gly Gly Thr Ala 
305 

■> 

Trp Cys Gly Leu Leu 
325 

Tyr Ser Ser Tyr Ala 
340 

Arg Gly Phe Lys Ala 
355 

Leu Arg Leu Lys Cys 
370 



Leu Gin Thr Val Cys 
385 

Tyr Arg Phe His Ala 
405 

Trp Lys Asn Pro His 
420 

Leu Leu Leu His Pro 
435 

Gin Gly Arg Arg Arg 
450 



Gly Leu Asp Asp lie His 
135 

Arg Ala Gin Asp Pro Pro 
150 155 

Thr Gly Ala Tyr Asp Thr 
170 

Ala Ser lie lie Lys Pro 
185 

Val Val Gin Lys Ala Ala 
200 

Thr Ser Tyr Val Gin. Cys 
215 

Thr Leu Leu Cys Ser Leu 
230 235 

Ala Gly lie Arg Arg Asp 

•■ 250 

Leu Leu Val Thr Pro His 
265 

Leu Val Arg Gly Val Pro 
280 

Thr Val Val Asn Phe Pro 
295 

Phe Val Gin Met Pro Ala 
310 315 

Leu Asp Thr Arg Thr Leu 
330 

Arg Thr Ser lie Arg Ala 
345 

Gly Arg Asn Met Arg Arg 
36 0 

His Ser Leu Phe Leu Asp 
375 

Thr Asn lie Tyr Lys lie 
390 395 

Cys Val Leu Gin Leu Pro 
410 

Phe Ser Cys Ala Ser Ser 
425 

Glu Ser Gin Glu Arg Arg 
440 

Pro Ser Ala Leu Arg Gly 
455 




Arg Ala Trp Arg Thr 
140 

Pro Glu Leu Tyr Phe 
160 

lie Pro Gin Asp Arg 
175 

Gin Asn Thr Tyr Cys 
190 

Met Gly Thr Ser Ala 
205 

Gin Gly lie Pro Gin 
220 

Cys Tyr Gly Asp Met 
240 

Gly Leu Leu Leu Arg 
255 

Leu Thr His Ala Lys 
270 

Glu Tyr Gly Cys Val 
285 

Val Glu Asp Glu Ala 
300 

His Gly Leu Phe Pro 
320 

Glu Val Gin Ser Asp 
335 

Ser Leu Thr Phe Asn 
350 

Lys Leu Phe Gly Val 
365 

Leu Gin Val Asn Ser 
380 

Leu Leu Leu Gin Ala 
400 

Phe His Gin Gin Val 
415 

Leu Thr Arg Leu Pro 
430 

Asp Val Ala Gly Gly 
445 

Arg Ala Val Ala Val 
460 
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Pro Pro Ser lie Pro Ala Gin Ala Asp Ser Thr Pro Cys His Leu Arg 
465 470 475 480 

Ala Thr Pro Gly Val Thr Gin Asp Ser Pro Asp Ala Ala Glu Ser Glu 
485 490 495 

Ala Pro Gly Asp Asp Ala Asp Cys Pro Gly Gly Arg Ser Gin Pro Gly 
500 505 510 

Thr Ala Leu Arg Leu Gin Asp His Pro Gly Leu Met Ala Thr Arg Pro 
515 520 525 

Gin Pro Gly Arg Glu Gin Thr Pro Ala Ala Leu Ser Arg Arg Ala Tyr 
530 535 540 

Thr Ser Gin Gly Gly Arg Gly Gly Pro His Pro Gly Leu His Arg Trp 
545 550 555 560 

Glu Ser Glu Ala 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 : 
CCAGTGAGCA GAGTGACGAG GACTCGAGCT CAAGCTTTTT TTTTTTTTTT 50 

(2) INFORMATION FOR SEQ ID NO:103: 

(i) -SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 
CCAGTGAGCA GAGTGACG 18 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
GAGGACTCGA GCTCAAGC 18 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: 
CACTGATCCT TTCTTTTTCG TAAACGATAG GT 32 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
CATCAATCAA ATCTTCCATA TAGAAATGAC A 31 

(2) INFORMATION FOR SEQ ID NO : 107 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE : 

(A) NAME /KEY : modif ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod__base= OTHER 

/note= "N = 5 ' -phosphorylated guano sine " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
NGGCCGTGTT GGCCTAGTTC TCTGCTC 27 



(2) INFORMATION FOR SEQ ID NO:108: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108 
GAGGAGGAGA AGAGCAGAGA ACTAGGCCAA CACGCCCC 
(2) INFORMATION FOR SEQ ID NO: 109: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109 
GTGTCATTTC TATATGGAAG ATTTGATTGA TG 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 
ACCTATCGTT TACGAAAAAG AAAGGATCAG TG 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 
GAGTGACATA ATATACGTGA 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
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.(C) STRANDEDNESS: 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe 
15 10 15 

Tyr Arg Lys Ser Val Trp Ser Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Arg Gin His Leu Lys Arg Val Gin Leu Arg Asp Val Ser Glu Ala Glu 
1 5 10 15 

Val Arg Gin His Arg Glu Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
1 5 10 15 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



81 



(ii) MOLECULE TYPE : peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:115: 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu 
1 5 10 15 

Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin 
20 25 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:116: 

Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly 
1 5 10 15 

lie Arg Gin His Leu Lys Arg Val Gin Leu Arg Asp Val Ser 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 117 : 

Pro Ala Leu Leu Thr Ser, Arg Leu Arg Phe lie Pro Lys Pro Asp Gly 
1 5 10 15 

Leu Arg Pro lie Val Asn Met Asp Tyr Val Val 
20 25 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



YARACHAARG GHATYCCHYA RGG 



23 



(2) INFORMATION FOR SEQ ID NO: 11 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

Gin Thr Lys Gly lie Pro Gin Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 
NGTNATDARD ARRTARTCRT C 21 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Asp Asp Tyr Leu Leu lie Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Lys Gly lie Pro Gin Gly Leu Cys Val Ser Ser lie Leu Ser Ser Phe 
1 5 10 15 

Tyr Tyr Ala Thr Leu Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu 
20 25 30 

Ser Met Asn Pro Glu Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr 
35 40 45 

Asp Asp Tyr Leu Leu lie Thr 
50 55 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Ser lie Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu lie 
1 5 10 15 

Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys Gly Ser Val Leu Leu Arg 
20 25 30 

Val Val 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 49. amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 

Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro lie Val Asp Leu 
1 5 10 15 

Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe Lys Ala Ser Pro 
20 25 30 

Ser Gin Asp Thr -Leu lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie 
35 40 45 

Ser 



84 



(2) INFORMATION FOR SEQ ID NO:125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 125: 

Gln.Lys Val Gly lie Pro Gin Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

'( 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
CAAAAAGTTG GTATCCCTCA GGG 
(2) INFORMATION FOR SEQ ID NO: 127: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
AGAC CAAAGG AATTC CATCA GGCTCAATTC TGTCATCTTT TTTGTGTCAT TTCTATATGG 
AAGATTTGAT TGATGAATAC CTATCGTTTA CGAAAAAGAA AGGATCAGTG TTGTTACGAG 
TAGTCGACGA CTACCTCCTC ATCACC 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO:128; 



Lys 
1 



Gly He Pro Ser Gly Ser He Leu Ser Ser Phe Leu Cys His Phe 
5 10 15 



Tyr 



Met Glu Asp Leu He Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys 
20 25 30 



Gly 



Ser Val Leu Leu Arg Val Val Asp Asp Tyr Leu Leu He Thr 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA ( genomi c ) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 
GACGATTTCC TCTTTATAAC A 21 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

Asp Asp Phe Leu Phe He Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO:131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
AAAAAAAAAA AAAAAA 16 
(2) INFORMATION FOR SEQ ID NO : 1 3 2 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 
TTTTTTTTTT TTTTTTT 

(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .35 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Schizosaccharomyces pombe tezlp" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

Trp Leu Tyr Asn Ser Phe lie lie Pro lie Leu Gin Ser Phe Phe Tyr 
15 10 15 

lie Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys 
20 25 30 

Asp lie Trp 
35 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

Ala Val lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu lie Thr 
15 10 15 
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Asn Leu Arg Lys Arg Phe 
20 

(2) INFORMATION FOR SEQ ID NO: 135: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 
■(B) LOCATION: 1..26 

(D) OTHER INFORMATION: /note= "motif 3(A) peptide from 

Schizosaccharomyces pombe tezlp" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: 

Lys Lys Tyr Phe Val Arg lie Asp lie Lys Ser Cys Tyr Asp Arg He 
1 5 10 15 

Lys Gin Asp Leu Met Phe Arg He Val Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..32 

(D) OTHER INFORMATION: /note- "motif 4 (B' ) peptide from 

Schizosaccharomyces pombe tezlp" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Tyr Leu Gin Lys Val Gly He Pro Gin Gly Ser He Leu Ser Ser Phe 
1 5 10 15 

Leu Cys His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu Ser Phe 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .49 

(D) OTHER INFORMATION: /note- "motif 5(C) and 6(D) peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:137: 

Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe lie Thr Val Asn Lys 
1 5 10 15 

Lys Asp Ala Lys Lys Phe Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys 
20 25 30 

His Asn Phe Ser Thr Ser Leu Glu Lys Thr Val lie Asn Phe Glu Asn 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: . 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . • 34 

(D) OTHER INFORMATION: /note- "motif 0 peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:138: 

Trp Leu Phe Arg Gin Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr 
1 5 10 15 

Cys Thr Glu lie Ser Ser Thr Val Thr lie Val Tyr Phe Arg His Asp 
20 25 30 

Thr Trp 



(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..25 

(D) OTHER INFORMATION; /note= "motif 1 and 2 peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

Ser Lys Met Arg lie lie Pro Lys Lys Ser Asn Asn Glu Phe Arg lie 
1 5 10 15 

He Ala He Pro Cys Arg Gly Ala Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 14 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .26 

(D) OTHER INFORMATION: /note= "motif 3(A) peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0: 

Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser He 
1 5 10 15 

Pro Arg Met Glu Cys Met Arg He Leu Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 
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(B) LOCATION: 1..32 

(D) OTHER INFORMATION: /noter "motif 4 (B' ) peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

Tyr lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro 
1 5 10 15 

lie Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 142: 

{ i ) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .49 

(D) OTHER INFORMATION: /note** "motif 5(C) peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Leu lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie Ser Thr Asp Gin 
1 5 10 15 

Gin Gin Val He Asn He Lys Lys Leu Ala Met Gly Gly Phe Gin Lys 
20 25 30 

Tyr Asn Ala Lys Ala Asn Arg Asp Lys He Leu Ala Val Ser Ser Gin 
35 40 45 

Ser 

(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..35 

(D) OTHER INFORMATION: /note* "motif 0 peptide from 

Euplotes aediculatus pl23" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 

Trp lie Phe Glu Asp Leu Val Val Ser Leu lie Arg Cys Phe Phe Tyr 
1 5 10 15 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
20 25 30 

Asn lie Trp 
35 



(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) ' TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

Gly Lys Leu Arg Leu lie Pro Lys Lys Thr Thr Phe Arg Pro lie Met 
1 5 10 15 

Thr Phe Asn Lys Lys lie Val 
20 



(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .26 

(D) OTHER INFORMATION: /note= "motif 3(A) peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5: 

Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp Ser Val 
1 5 10 15 
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Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..32 

(D) OTHER INFORMATION: /note= "motif 4 (B' ) peptide from 

Euplotes aediculatus pl2 3" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

Tyr Lys Gin Thr Lys Gly He Pro Gin Gly Leu Cys Val Ser Ser He 
1 5 10 15 

Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser Ser Leu Gly Phe 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .49 

(D) OTHER INFORMATION: /note= "motif 5(C) and 6(D) peptide from 

Euplotes aediculatus pl23" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:147: 

Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr Thr Gin Glu 
1,5 10 15 

Asn Asn Ala Val Leu Phe He Glu Lys Leu He Asn Val Ser Arg Glu 
20 25 30 

Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr Ser Phe Pro Leu 
35 40 45 

Ser 
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(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID- NO: 148: 

Leu Val Val Ser Leu lie Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin 
1 5 10 15 

Lys Ser Tyr Ser Lys Thr 
20 



(2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .30 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu lie Pro Lys Lys 
15 10 15 

Thr Thr Phe Arg Pro lie Met Thr Phe Asn Lys Lys lie Val 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid- 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Euplotes aediculatus p!23 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Pro Lys Leu Phe Phe Ala Thr Met Asp lie Glu.Lys Cys Tyr Asp Ser 
1 5 10 15 

Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .27 

(D) OTHER INFORMATION : /note= "motif B peptide from 

Euplotes aediculatus pl23" 

(xi) SEQUENCE DESCRIPTION: . SEQ ID NO: 151: 

Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys 
1 5 10 15 

Val Ser Ser lie Leu Ser Ser Phe Tyr Tyr Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 22 

■(D) OTHER INFORMATION: /note= "motif C peptide from 

Euplotes aediculatus pl2 3" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:152: 

Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu lie 
1 5 10 15 

Thr Thr Gin Glu Asn Asn 
20 



(2) INFORMATION FOR SEQ ID NO:153: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .15 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Euplotes aediculatus pl23 ,r 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 153 : 

Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:154: 

Phe lie lie Pro lie Leu Gin Ser Phe Phe Tyr lie Thr Glu Ser Ser 
1 5 10 15 

Asp Leu Arg Asn Arg Thr 
20 



(2) INFORMATION FOR SEQ ID NO: 155: 
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{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..30 

(D) OTHER INFORMATION: /note= "motif .0 peptide from 

Schizosaccharomyces pombe tezl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

Gin Lys Thr Thr Leu Pro Pro Ala Val lie Arg Leu Leu Pro Lys Lys 
1 5 10 15 

Asn Thr Phe Arg Leu lie Thr Asn Leu Arg Lys Arg Phe Leu 
20 25 30 

(2) INFORMATION FOR SEQ ID NO:156: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .27 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Schizosaccharomyces pombe tezl" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 : 

Arg Lys Lys Tyr Phe Val Arg lie Asp lie Lys Ser Cys Tyr Asp Arg 
1 5 10 15 

He Lys Gin Asp Leu Met Phe Arg He Val Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 15 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .27 

(D) OTHER INFORMATION: /note* "motif B peptide from 

Schizosaccharomyces pombe tezl" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

Gly Asn Ser Gin Tyr Leu Gin Lys Val Gly lie Pro Gin Gly Ser lie 
15 10 15 

Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu 
20 25 



(2) INFORMATION FOR SEQ ID NO:158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION : /note= "motif C peptide from 

Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:158: 

Lys Lys Gly Ser Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe lie 
1 5 10 15 

Thr Val Asn Lys Lys Asp 
20 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .15 

(D) OTHER INFORMATION: /note* "motif D peptide from 

Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 9: 
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Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys His Asn Phe Ser Thr 
1.5 10 15 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from 

Saccharomyces cerevisiae EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr Cys Thr Glu lie Ser 
1 5 10 15 

Ser Thr Val Thr lie Val 
20 

(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY: Peptide 

(B) LOCATION; 1..32 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Saccharomyces cerevisiae EST2 " 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 161: 

Thr Leu Ser Asn Phe Asn His Ser Lys Met Arg lie lie Pro Lys Lys 
1 5 10 15 

Ser Asn Asn Glu Phe Arg lie lie Ala lie Pro Cys Arg Gly Ala Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..2 7 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Saccharomyces cerevisiae EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser 
1 5 10 15 

lie Pro Arg Met Glu Cys Met Arg lie Leu Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO:163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note- "motif B peptide from 

Saccharomyces cerevisiae EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:163: 

Glu Asp Lys Cys Tyr lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser 
1 5 10 15 

Leu Ser Ala Pro lie Val Asp Leu Val Tyr Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 
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(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif C peptide from 

Saccharomyces cerevisiae EST2" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Ser Gin Asp Thr Leu lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie 
1 5 10 15 

Ser Thr Asp Gin Gin Gin 
20 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .15 

(D) OTHER INFORMATION: /note= "motif D peptide from 

Saccharomyces cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .22 

(D) OTHER INFORMATION: /note= "motif 1 peptide from human 

telomerase core protein 1 (TCP1) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr 
1 5 10 15 

Phe Gin Lys Asn Arg Leu 
20 
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(2) INFORMATION FOR SEQ ID NO:167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
'.(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : peptide 

(ix) FEATURE: 

(A) NAME /KEY: Peptide 

(B) LOCATION: 1. .30 

(D) OTHER INFORMATION: /note= "motif 0 peptide from human 

telomerase core protein 1 (TCP1) " 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:167: 

Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro 
1 5 % 10 15 

Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val Gly 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..2 7 

(D) OTHER INFORMATION: /note= "motif A peptide from human 

telomerase core protein 1 (TCP1) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr 
1 5 10 15 

lie Pro Gin Asp Arg Leu Thr Glu Val lie Ala 
20 25 

(2) INFORMATION FOR SEQ ID. NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Protein 

(B) LOCATION: 1..27 

(D) OTHER INFORMATION: /note- "motif B peptide from human 

telomerase core protein 1 (TCP1) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:169: 

Arg Ala Thr Ser Tyr Val Gin Cys Gin Gly lie Pro Gin Gly Ser lie 
15 10 15 

Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly 
20 25 

(2) INFORMATION FOR SEQ ID NO:170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..22 

(D) OTHER INFORMATION: /note= "motif C peptide from human 

telomerase core protein 1 (TCP1) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp- Asp Phe Leu Leu Val 
1 5 10 15 

Thr Pro His Leu Thr His 
20 

(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..15 

(D) OTHER INFORMATION: /note= "motif D peptide from human 

telomerase core protein 1 (TCP1) " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val 
1 5 10 15 
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